Integrin β subunit and uses thereof

ABSTRACT

The present invention provides substantially pure integrins containing a novel β subunit designated as β 6 . The novel β 6  subunit forms heterodimers with α V  and α F . Methods of controlling cell adhesion using the B 6  -containing integrins are also provided.

This work was supported in part by research grants HL/AL 33259, CA-47541 and CA-47858 from the National Institutes of Health. The U.S. Government has rights in the invention.

BACKGROUND OF THE INVENTION BACKGROUND ART

This invention relates to receptors for adhesion peptides, and more specifically to a novel receptor subunit having affinity for extracellular matrix molecules.

Multicellular organisms, such as man, have some 10¹⁴ cells which can be divided into a minimum of fifty different types, such as blood cells and nerve cells. During the course of growth and development, cells adhere to other cells, or to extracellular materials, in specific and orderly ways. Such cell adhesion mechanisms appear to be of importance in mediating patterns of cellular growth, migration and differentiation, whereby cells develop specialized characteristics so as to function as, for example, muscle cells or liver cells. Cell adhesion mechanisms are also implicated in dedifferentiation and invasion, notably where cells lose their specialized forms and become metastasizing cancer cells.

The mechanisms underlying the interactions of cells with one another and with extracellular matrices are not fully understood, but it is thought that they are mediated by cell surface receptors which, specifically recognize and bind to a cognate ligand on the surface of cells or in the extracellular matrix.

The adhesion of cells to extracellular matrices and their migration on the matrices is mediated in many cases by the binding of a cell surface receptor to an Arg--Gly--Asp containing sequence in the matrix protein, as reviewed in Ruoslahti and Pierschbacher, Science 238:491-497 (1987). The Arg--Gly--Asp sequence is a cell attachment site at least in fibronectin, vitronectin, fibrinogen von Willibrand, thrombopondin, osteopontin, and possibly various collagens, laminin and tenascin. Despite the similarity of their cell attachment sites, these proteins can be recognized individually by their interactions with specific receptors.

The integrins are a large family of cell surface glycoproteins that mediate cell-to-cell and cell-to-matrix adhesion as described, for example, in the Ruoslahti and Pierschbacher article cited above. All known members of this family of adhesion receptors are heterodimers consisting of an α and a β subunit noncovalently bound to each other. When the integrin family was first identified, integrins were grouped into three subfamilies based on the three β subunits that were initially recognized (β₁, β₂ and β₃). Over the past few years, the primary structures of three integrin β subunits from mammalian cells and one from Drosophila have been deduced from cDNA.

Each α subunit was thought to associate uniquely with a single β subunit. Eleven distinct α subunits have thus far been described. As new integrins have been identified, however, it has become clear that this grouping is not entirely satisfactory, since there are clearly more than three β subunits and since some α subunits can associate with more than one β subunit as described, for example, in Sonnenberg et al., J. Biol. Chem. 265:14030-14038 (1988).

Because of the importance of integrins in mediating critical aspects of both normal and abnormal cell processes, a need exists to identify and characterize different integrins. The present invention satisfies this need and provides related advantages as well.

SUMMARY OF THE INVENTION

The present invention relates to a substantially purified β subunit of an integrin cell surface receptor designated as β₆. The amino acid sequence of human β₆ (SEQ ID NO:27) is provided in FIG. 3.

The present invention also relates to amino acid fragments specific to β₆ that have a variety of uses. The invention further relates to vectors having a gene encoding such fragments. Host cells containing such vectors are also provided. The nucleic acids encoding β₆ as well as nucleic acids that specifically hybridize with the nucleic acids encoding β₆ sequences are other aspects of the present invention.

In a further aspect, the present invention relates to a substantially purified integrin comprising β₆ bound to an α subunit, particularly α_(V) or α_(F). Methods of blocking the attachment of the β₆ -ccntaining integrins to its ligand and of detecting the binding of such integrins to its ligand are also provided.

The present invention also relates to methods of increasing or decreasing cell adhesion in cells expressing a β₆ -containing integrin by overexpresssing the integrin or by binding the integrin with a ligand, such as vitronectin.

BRIEF DESCRIPTION OF THE FIGURES

FIG. 1 shows the design of PCR primers (SEQ ID NOS:1 through 5, 7 and 9).

FIG. 2 shows a map of sequencing strategy.

FIG. 3 shows the nucleotide sequence and amino acid translation for human (H) (SEQ ID NOS:26 and 27) and guinea pig (GP) (SEQ ID NOS:28 and 29) β₆.

FIG. 4 shows the alignment of β₆ (SEQ ID NO:27) with four previously reported integrin β subunits (SEQ ID NOS:30 through 33).

FIG. 5 shows the alignment of partial nucleotide and amino acid sequences from human (H) and guinea pig (GP) β₁, (SEQ ID NOS:34 through 37), β₃ (SEQ ID NOS:38 through 41), and β₆ (SEQ ID NOS:26 and 29) for the region just downstream from the β3F primer.

DETAILED DESCRIPTION OF THE INVENTION

The present invention provides a composition of matter relating to a novel, substantially purified integrin β subunit, referred to herein as β₆. The amino acid sequence of β₆ for human (SEQ ID NO:27) and for guinea pig (SEQ ID NO:29) are also provided and are shown in FIG. 3.

By "substantially purified" is meant substantially free of contaminants normally associated with a native or natural environment.

By "β₆ " is meant a polypeptide having substantially the same amino acid sequence and binding functions of the polypeptides encoded by the sequences set forth in FIG. 3 for human (SEQ ID NO:26) and guinea pig (SEQ ID NO:28) β₆. Thus, modified amino acid sequences that do not substantially destroy the functions and retain the essential sequence of β₆ are included within the definition of β₆. Amino acid sequences, such as the sequence for β₁ (SEQ ID NO:30), (SEQ ID NO:31) and β₃ (SEQ ID NO:32), having less than 50% homology with the sequence of β₆, are not substantially the same sequence and, therefore, do not fall within the definition of β₆. Given the amino acid sequences set forth herein, additions, deletions or substitutions can be made and tested to determine their effect on the function of β₆. In addition, one skilled in the art would recognize that certain amino acids, such as the conserved cystines, for example, can be modified to alter a binding function of β₆.

Amino acids are identified herein by the standard one-letter abbreviations, as follows:

    ______________________________________            Amino Acid                     Symbol     ______________________________________            Alanine  A            Asparagine                     N            Aspartic acid                     D            Arginine R            Cysteine C            Glutamine                     Q            Glutamic acid                     E            Glycine  G            Histidine                     H            Isoleucine                     I            Leucine  L            Lysine   K            Methionine                     M            Phenylalanine                     F            Proline  P            Serine   S            Threonine                     T            Tryptophan                     W            Tyrosine Y            Valine   V     ______________________________________

Based on its amino acid sequence, the β subunit of the present invention is clearly different from β₁, β₂, β₃ and other β subunits that have recently been discovered. For example, the 11-amino acid carboxyl-terminal extension on β₆ distinguishes it from β₁, β₂, and β₃. The short cytoplasmic tails of β₁, β₂, and β₃ are thought to be sites of interaction with the cytoskeleton and regions for the transduction of signals initiated by interactions of the large extracellular domains with ligands. These cytoplasmic tails may also be targets for regulation of integrin function. The distinctive 11-amino acid cytoplasmic tail of β₆ indicates that its regulation or pathways for signal transduction may be different from those of β₁, β₂ and β₃.

In addition to β₁, β₂ and β₃, recent studies have suggested the existence of as many as five other integrin β subunits. A β subunit with a molecular weight of approximately 210,000 (β₄) has been found associated with the integrin a subunit "α₆ " in colon carcinoma cells and in a variety of other tumor cells of epithelial origin as described, for example, in Kajiji et al., EMBO J., 8:673-680 (1989). On the basis of its high molecular weight, 210,000 compared with the predicted size of 106,000 of the subject novel protein, and on the basis of its clearly different amino-terminal sequence, it is apparent that β₄ is not the same as the subject polypeptide.

Another β subunit, originally called β_(x) was identified in epithelial-derived tumor cells in association with the integrin α subunit α_(v) as described, for example, in Cheresh et al., Cell 57:59-69 (1989). This β subunit, having a distinctive amino-terminal sequence, was recently renamed β₅. Based on recent studies of purified preparations, β₅ clearly differs from the β subunit of the present invention. Because the β subunit described in the present report is distinct from each of the five β subunits for which sequence information is available, it has been designated as β₆.

The existence of two other integrin β subunits has been inferred from the identification of unique proteins after immunoprecipitation of surface-labeled cell lysates with antibodies to known α subunits. One of these novel proteins, called β_(S) was found in association with α_(v) in the human osteosarcoma cell line MG-63, in the fibroblast cell line AF1523, and in human endothelial cells as described, for example, in Freed et. al., EMBO J. 8:2955-2965 (1989). This subunit is also different from β₆ since β_(S) is expressed in MG-63 cells while β₆ is not expressed in these cells as shown in Table 1.

The other novel integrin β subunit identified by co-immunoprecipitation of known α subunits, B_(p), is a protein of about M_(r) 95,000 that is found to be associated with α₄ an α subunit first found as part of the lymphocyte homing receptor VLA-4 as described, for example, in Holzmann et al., Cell 45:37-46 (1989). This subunit is also distinct from β₆ since β_(P) is expressed in lymphocytes while β₆ is not expressed in lymphocytes as shown in Table 1.

                  TABLE 1     ______________________________________     Distribution of β.sub.6            Type     Results    Source     ______________________________________     Cell Lines:     FG-2     Pancreatic +          Kajiji et al.,                                    EMBO J.                                    3:673-80 (1989)     Panc I   Pancreatic -          Dr. Metzgar,                                    Duke U., N. C.     Colo-396 Colon CA   +          Dr. L. Walker,                                    Cytel, San                                    Diego, CA     UCLA P3  Lung CA    +          Dr. L. Walker,                                    Cytel, San                                    Diego, CA     HeLa     Cervical   -          ATCC #CCL-2     Jar      Chorio CA  +          ATCC #HTB 36     HT 1080  Fibrosarcoma                         -          ATCC #CCL 121     U 937    Monocytoid -          ATCC #CRL 1593     M 21     Melanoina  -          Dr. R.                                    Reisfeld,                                    Scripps Clinic                                    & Research                                    Foundation,                                    La Jolla, CA     B 16     Melanoma   -          Dr. R.                                    Reisfeld                                    Scripps Clinic                                    & Research                                    Foundation,                                    La Jolla, CA     MG 63    Osteosarcoma                         -          ATCC #CRL 1427     Tissues:              Cervix     +              Aortic Endothelium                         -              Leukocytes -     ______________________________________

The invention also provides an integrin comprising β₆ bound to an a subunit. β₆, consistent with recent findings of other β subunits, can associate with a variety of α subunits to form a functional integrin. In one embodiment, β₆ associates with α_(v). In another embodiment, β₆ associates with another a subunit referred to herein as α_(F). The α_(v) β₆ integrin, as well as other integrins containing β₆, can bind molecules, for example extracellular matrix molecules. Such molecules are referred to herein as ligands. In a specific embodiment, certain β₆ -containing integrins can bind Arg--Gly--Asp-containing polypeptides such as vitronectin or fibronectin. The binding of β₆ -containing integrins to various ligands can be determined according to procedures known in the art and as described for example, in Ruoslahti and Pierschbacher, Science 238:491-497 (1987).

The invention also provides an amino acid fragment specific to β₆. Since β₆ is a novel molecule, it contains many fragments which are specific for this β subunit. Fragments specific to β₆ contain sequences having less than 50% homology with sequences of other known integrin β subunit fragments. These fragments are necessarily of sufficient length to be distinguishable from known fragments and, therefore, are "specific for β6." The amino acid sequence of such fragments can readily be determined by referring to the figures which identify the β₆ amino acid sequences. These fragments also retain the binding function of the β₆ subunit and can therefore be used, for example, as immunogens to prepare reagents specific for β₆ or as an indicator to detect the novel β₆ -containing integrin of the present invention. One skilled in the art would know of other uses for such fragments.

The invention also provides a reagent having specificity for an amino acid sequence specific for β₆. Since β₆ is a novel protein with at least 50% amino acid differences over related β subunits, one skilled in the art could readily make reagents, such as antibodies, which are specifically reactive with amino acid sequences specific for β₆ and thereby immunologically distinguish β₆ from other molecules. Various methods of making such antibodies are well established and are described, for example, in Antibodies, A Laboratory Manual, E. Harlow and D. Lane, Cold Spring Harbor Laboratory 1988, pp. 139-283 and Huse et al., Science 24:1275-1280 (1988).

The invention also provides nucleic acids which encode β₆. Examples of such sequences are set forth in FIG. 3 (SEQ ID NOS:26 and 28). Following standard methods as described, for example, in Maniatis et al., Molecular Cloning, Cold Spring Harbor (1982), nucleic acid sequences can be cloned into the appropriate expression vector. The vector can then be inserted into a host, which will then be capable of expressing recombinant proteins. Thus, the invention also relates to vectors containing nucleic acids encoding such sequences and to hosts containing these vectors.

The sequences set forth in FIG. 3 (SEQ ID NOS:26 and 28) also provide nucleic acids that can be used as probes for diagnostic purposes. Such nucleic acids can hybridize with a nucleic acid having a nucleotide sequence specific for β₆ but do not hybridize with nucleic acids encoding non-β₆ proteins, particularly other cell surface receptors. These nucleic acids can readily be determined from the sequence of β₆ and synthesized using a standard nucleic acid synthesizer. Nucleic acids are also provided which specifically hybridize to either the coding or non-coding DNA of β₆.

Integrin cell surface receptors bind ligands, such as extracellular matrix molecules. However, the binding of the integrin to the ligand can be blocked by various means. For example, the binding of a β₆ -containing integrin can be blocked by a reagent that binds the β₆ subunit or the β₆ -containing integrin. Examples of such reagents include, for example, Arg--Gly--Asp-containing peptides and polypeptides, ligand fragments containing the integrin binding site, as well as antibodies specifically reactive with β₆ or a β₆ -containing integrin. Alternatively, the blocking can be carried out by binding the ligand or fragment thereof, recognized by a β₆ -containing integrin with a reagent specific for the ligand at a site that inhibits the ligand from binding with the integrin. Since the binding of a β₆ -containing integrin to its ligand can mediate cell adhesion to an extracellular matrix molecule, preventing this binding can prevent cell adhesion. Alternatively, cell adhesion can be promoted by increasing the expression of β₆ -containing integrins by a cell.

Finally, the invention provides a method of detecting ligands which bind a β₆ -containing integrin. The method comprises contacting a β₆ -containing integrin with a solution containing ligands suspected of binding β₆ -containing integrins. The presence of ligands which bind a β₆ -containing integrin is then detected.

The following examples are intended to illustrate but not limit the invention.

EXAMPLE I Identification of a Novel β Subunit

A. Generation of cDNA Fragments by Polymerase Chain Reaction

Tracheal epithelial cells, harvested from male Hartley outbred guinea pigs (Charles River Breeding Laboratories, Bar Harbor, Me.) were grown to confluence over 10-14 days on collagen-impregnated microporous filters commercially available from Costar. RNA was harvested from these primary cultures, and mRNA was purified over oligo(dT)-cellulose columns using he Fast Track mRNA isolation kit (Invitrogen, San Diego, Calif.). Two to 5 μg of mRNA was used as a template for cDNA synthesis catalyzed by 200 units of Moloney murine leukemia virus reverse transcriptase (Bethesda Research Laboratories, Gaithersburg, Md.) in a 20-40 μl reaction volume. One to 5 μl of the resultant cDNA was used as a template for polymerase chain reaction (PCR). PCR was carried out in a reaction volume of 25-200 μl. In addition to the template cDNA, each PCR reaction contained 50 mM KCl, 10 mM Tris-HCl (pH 9.0 at 25° C.), 1.5 mM MgCl₂, 0.01% gelatin, 0.1% Triton X-100, 0.2 mM each of dATP, dGTP, dCTP and dTTP, and 0.05 units/μl Taq DNA polymerase (obtained from either United States Biochemical Corporation, Cleveland, Ohio, or from Promega, Madison, Wis.).

For each reaction, two olhgonucleotide primers were also added to obtain a final concentration of 1 μM each. The primer pairs are identified below. Each reaction mixture was overlaid with mineral oil, heated to 95° C. for 4 min. in a thermal cycler (Ericomp, San Diego, Calif.), and then subjected to 30 cycles of PCR. Each cycle consisted of 45 seconds at 95° C., 45 seconds at 53° C., and 1 min. at 72° C. Immediately after the last cycle, the sample was maintained at 72° C. for 10 min.

The results of each PCR reaction were analyzed by gel electrophoresis in 1.5% agarose. Reactions that produced fragments of the expected size were electrophoresed in 1.5% low gel temperature agarose (Bio-Rad Laboratories, Richmond, Calif.). The appropriate size band was excised, melted at 68° C., and the DNA was purified by extraction with phenol/chloroform and precipitation in ethanol and ammonium acetate.

B. PCR Primers

To obtain the initial fragment of the novel β subunit cDNA described herein, degenerate mixtures of PCR primers were used. Oligonucleoticdes were synthesized, trityl-on, by the University of California, San Francisco Biomolecular Resource Center using a DNA synthesizer with standard procedures, and purified over Nen-sorb cartridges (DuPont-New England Nuclear, Boston, Mass.). These consensus primer mixtures were designed to anneal with the nucleotides encoding the highly conserved sequence Asp--Leu--Tyr--Tyr--Leu--Met--Asp--Leu (primer B1F) (SEQ ID NO:3) and Glu--Gly--Gly--Asp--Ala--Ile--Met--Gln (primer B2R) (SEQ ID NO:2) that flank an approximately 300-nucleotide region beginning approximately 130 amino acids from the amino terminus of each of the integrin β subunits sequenced to date. The sequences of the primers identified herein are depicted in FIG. 1 (SEQ ID NOS:1 through 5).

On the basis of the initial sequence obtained, a specific forward primer was designed to anneal with the sequence encoding the amino acids Pro--Leu--Thr--Asn--Asp--Ala--Glu--Arg (primer BTE2F) (SEQ ID NO:6) ending approximately 49 nucleotides from the 3' end of the region that had been sequenced. An additional forward primer (B3F) (SEQ ID NO:3) and two reverse primers (B3R and B4R) (SEQ ID NOS:4 and 5) were also designed to recognize highly conserved consensus regions encoding the sequences Gly--Glu--Cys--Val--Cys--Gly--Gln--Cys (B3 region) (SEQ ID NOS:3 and 4) and Ile--Gly--Leu--Ala--Leu--Leu--Leu--Ile--Trp--Lys (B4 region) (SEQ ID NO:5). The alignment of these primers with previously published sequences of human β₁, β₂ and β₃ and chicken β₁ is shown in FIG. 1. PCR as described above was performed with cDNA from guinea pig tracheal epithelial cells and the primer pairs BTE2F/B3R (SEQ ID NOS:7 and 4) and B3F/B4R (SEQ ID NOS:3 and 5).

The primer pair BTE2F/B3R (SEQ ID NOS:7 and 4) yielded 1095 additional base pairs of new sequence. Based on this sequence another specific primer (BTE3F) (SEQ ID NO:9) was designed to recognize the sequence Val--Ser--Glu--Asp--Gly--Val near the 3' end of this sequence, and PCR was performed with this primer in combination with primer B4R.

FIG. 1 shows the design of the PCR primers. β subunit consensus primer mixtures were designed on the basis of alignment of published sequences of human β₁, β₂, β₃ and chicken β₁. For forward primers (B1F and B3F) (SEQ ID NOS:1 and 3), the primer sequences included a single nucleotide whenever possible for each of the first two nucleotides of each codon and were usually either degenerate or included deoxyinosine for the third base in codons for amino acids other than methionine. Reverse primers (B2R, B3R, and B4R) (SEQ ID NOS:2, 3 and 5) were designed in the same manner for :he complementary DNA strand. Two specific forward primers were designed to recognize β₆. The first (BTE2F) (SEQ ID NO:7) was designed to work across species and was thus degenerate or included deoxyinosine in the third codon position. The second, BTE3F (SEQ ID NO:9), was not degenerate and was designed to only recognize guinea pig β₆.

C. Cloning of Fragments Obtained by PCR

Individual fragments were cloned in pBluescript (Stratagene, San Diego, Calif.) as follows. Purified fragments were resuspended in distilled water containing deoxynucleotides and treated with 2.5 units of DNA polymerase I, large fragment (Promega) to fill in any 3' recessed ends left after the last cycle of PCR. The 5' ends were phosphorylated with 5 units of T4 polynucleotide kinase (New England Biolabs, Beverly, Mass.). An aliquot of the above reaction mixture containing approximately 100-200 ng of DNA, was ligated into pBluescript that had been cut with EcoRV (Promega) and dephosphorylated with calf intestinal alkaline phosphatase (Boehringer Mannheim, Indianapolis, Ind.). Ligations were performed at 22° C. for 1 hour with T4 DNA ligase (Bethesda Research Laboratories). The ligation mixture was used to transform competent Escherichia coli (JM109, Clontech, San Francisco, Calif.). Plasmids containing inserts were purified using the Pharmacia miniprep lysis kit (Pharmacia LKB Biotechnology, Inc., Piscataway, N.J.) denatured in 0.3 M NaOH, further purified over spin columns containing Sephacryl S-400 (Pharmacia), and then sequenced using the Sequenase™ version 2.0 sequencing kit (United States Biochemical Corp., Cleveland, Ohio) and ³⁵ S!dATP (Amersham Corp., Arlington Heights, Ill.).

D. Library Screening

PCR fragments generated with the primer pairs B1F/B2R (SEQ ID NOS:1 and 2) and BTE3F/B4R (SEQ ID NOS:9 and 5) were uniformlly labeled with alpha- ³² P!dCTP and used as probes to screen a random-primed cDNA library and an oligo-dT-primed cDNA library. Both libraries were constructed in the plasmid pTZ18R-BstXI obtained from Invitrogen (San Diego, Calif.) from mRNA obtained from the human pancreatic carcinoma cell line FG-2. Plasmid was purified from clones found to hybridize with either region and inserts were sequenced. A portion of insert DNA from one clone was in turn labeled and used to screen the same libraries Fourteen independent overlapping clones were sequenced from both ends using primers that recognize regions of the pTZ polylinker. The regions flanking the 3' end of the putative translated region of the new β subunit were sequenced in both directions from three clones using primers constructed to recognize sequences close to the 3' end. On the basis of the initial sequences thus obtained, an additional internal sequence was obtained from clones T10, T11, T12 and T14 (FIG. 2) after digestion with specific restriction endonucleases and relegation. Three internal fragments thus generated were subcloned into pBluescript and were also sequenced in both directions. Approximately 90% of the new sequence reported was obtained from both strands of DNA, and 97% was obtained from two or more overlapping clones (FIG. 2).

FIG. 2 shows a map of the sequencing strategy. Shown are the location of clones used to obtain the partial cDNA sequence of guinea pig β₆ (clones 1F, 3L, 3N and 3Y, top) and the complete sequence of human β₆ (clones T1-T19 bottom). Also shown is the location of the translated region (Protein). The location of the transmembrane domain is shown by the letters TM. Clones shown often represent one of several identical clones. Internal sequence of clones with long inserts was obtained by restriction endonuclease digestion and relegation and by ligation of internal fragments into pBluescript. Specific restriction sites employed are shown (Hind, HindIII; Hinc, HincII; Kpn, KpnI; Pst, PstI). The direction and extent of sequencing are shown by arrows. 1109 and 1110 are the sites recognized by oligonucleotide sequencing primers. T18 and T19 each terminated in a poly(A) tail. The regions recognized by the degenerate PCR primers B1F (B1), B2R (B2), B3R/F (B3)., and B4R (B4) and the β₆ primers BTE2F (BTE2) and BTE3F (BTE3) are noted abolie the guinea pig cDNA map, kb, kilobases.

E. Nucleotide Sequence of a Novel Guinea Pig Integrin β Subunit

PCR using cDNA from guinea pig airway epithelial cells and the consensus primer mixtures B1F and B2R (FIG. 1) amplified DNA fragments with the expected size of approximately 350 nucleotides. When the fragment DNA was sequenced after cloning into pBluescript, recombinant clones each contained inserts with one of two distinct sequences. One sequence encoded a stretch of 98 amino acids that was 97% identical to the expected region of human β₁ and was therefore presumed to be guinea pig β₁. The other sequence encoded 98 amino acids that were only 53% identical to human β₁, 45% identical to human β₂, and 57% identical to human β₃ (FIG. 2, clone 1F). Both of the guinea pig sequences included the integrin β subunit consensus sequences Ser--X--Ser--Met--X--Asp--Asp--Leu and Gly--Phe--Gly--Ser--Phe--Val, and both contained the 2 cysteine residues found in this region in all known integrin β subunits. These data suggest that ones of the two sequences we obtained encoded a new member of the integrin β subunit family.

This novel sequence was extended by further PCR steps utilizing primers specific for the novel sequence (BTE2F, BTE3F) (SEQ ID NOS:7 and 9) in combination with two additional degenerate primers (B3R and B4R, see FIGS. 1, 2 and 4). With the primer pair BTE2F/B3R (SEQ ID NOS:7 and 4) two different cDNA products were obtained (3L and 3N in FIG. 2) due to an unexpected hybridization of the β3R primer with a. site 220 nucleotides further downstream (B3' in FIG. 2). The 1732-nucleotide sequence determined from these clones is shown in FIG. 3.

FIG. 3 shows the nucleotide sequences and corresponding amino acid sequences for human (H) β₆ (SEQ ID NOS:26 and 27) and guinea pig (GP) β₆ (SEQ ID NOS:28 and 29). The amino acid translation is denoted by the single letter code beneath the second nucleotide of each codon from the translated region of human β₆. For the guinea pig sequence, only amino acids that differ from the human sequence are shown. The numbers along the right-hand margin denote the nucleotide or amino acid number of the last entry on each line. The numbering system used starts with the first nucleotide or amino acid available for each sequence shown. The nine potential sites for N-glycosylation in the putative extracellular domain of human β₆ are underlined.

F. Nucleotide Sequence of Human β₆

Screening of cDNA libraries constructed from the human pancreatic carcinoma cell line FG-2 with guinea pig cDNA probes 1F and 3Y (see FIG. 2) and subsequent screening with a probe constructed from a portion of clone T10 (FIG. 2) produced 14 independent positive clones. The two longest clones (T18 and T19) extended to the poly(A) tail. A map of these clones, constructed on the basis of sequence information and of the mobility of inserts cut out of these clones in agerose gels is shown in FIG. 2. This map predicts an mRNA of approximately 5 kilobases including at least a 226-nucleotide untranslated region at the 5' end and, a 2364-nucleotide open reading frame, and a 3' untranslated region of approximately 2.5 kilobases. This molecule has been termed integrin β₆.

FIG. 3 shows the partial nucleotide and complete amino acid sequences for human β₆ (SEQ ID NOS:26 and 27) (excluding most of the 3'-untranslated region) and the alignment of the 1732 nucleotides of sequence obtained from PCR of guinea pig airway epithelial cell cDNA. Of the 577 amino acids deduced from the region sequenced in both species only 36 residues differ; the amino acid sequences are 94% identical. Furthermore, of the 1732 nucleotides sequenced in both species, 91% are identical. Nine potential glycosylation sites present in the putative extracellular domain of human β₆ are shown by underlining. All seven of these sites that lie within the 577 amino acids obtained for guinea pig β₆ are also present in the guinea pig protein. If all of the potential glycosylation sites are occupied with oligosaccharides having an average molecular weight of 2,500, the predicted molecular weight of human β₆ would be 106,000.

Comparison of the 788-amino acid sequence deduced from the open reading frame to the three previously sequenced human β subunits (SEQ ID NOS:30 and 32) and the myospheroid protein of Drosophila (SEQ ID NO:33) is shown in FIG. 4.

FIG. 4 shows the alignment of β₆ with four previously reported integrin β subunits. Previously published sequences for human β₁ (SEQ ID NO:30), human β₂ (SEQ ID NO:31), human β₃ (SEQ ID NO:32), the myospheroid gene product (βmyo) of Drosophila (SEQ ID NO:33), and the novel sequence described as β₆ (SEQ ID NO:27) are shown using the single letter amino acid code. The 56 conserved cysteines are noted by * and the 120 other invartant amino acids by=above each line. The transmembrane domain is underlined. The regions used for constructing the consensus β subunit primers B1F (B1) (SEQ ID NO:1), B2R (B2) (SEQ ID NO:2), B3F/R (B3) (SEQ ID NOS:3 and 4), and B4R (B4) (SEQ ID NO:5) are labeled below the alignment in bold type. The numbers along the right-hand margin denote the number of the last amino acid in each line beginning from the first amino acid of each putative signal sequence.

There are 179 amino acid residues that are identical in each of the other β subunits and in β₆ including 56 conserved cysteine residues. The overall percentage of identical amino acids between β₆ and the other human β subunits is 47% for β₃, 42% or β₁ and 38% for β₂. Human β₆ is also 39% identical to the Drosophila β subunit. Human β₁, β₂ and β₃ and the Drosophila β subunit all have cytoplasmic regions consisting of 41 amino acids (beginning after the putative transmembrane domain shown by the underline in FIG. 4). Although β₆ contains each of the 10 conserved amino acid residues in this cytoplasmic region it also contains an 11-amino acid extension at the carboxyl terminus. β₆ also contains two Arg--Gly--Asp sequences, one at amino acids 514-516 and the other at 594-596. These regions could serve as recognition sites for other ligands of the integrin family.

PCR using the primer pair B3F/B4R (SEQ ID NOS:3 and 5) (see FIG. 1) amplified fragments of the expected size of approximately 750 nucleotides. Cloning and sequencing of the fragments did not result in any additional clones containing the novel β subunit sequence but did result in several clones with inserts encoding an amino acid sequence that was 97% identical to the corresponding region of human β₃ and several others encoding an amino acid sequence that was 93% identical to human β₁ (SEQ ID NO:35) (FIG. 5). These are presumably the guinea pig homologues of β1 (SEQ ID NOS:37 and 41) and β₃, respectively. The nucleotide sequences of guinea pig (SEQ ID NO:36) and human β₁ (SEQ ID NO:34), are 80% identical, and those of guinea pig (SEQ ID NO:40) and human β₃ (SEQ ID NO:38) are 91% identical.

FIG. 5 shows the alignment of partial nucleotide and amino acid sequences from human (H) and guinea pig (GP) β₁ (SEQ ID NOS:34 through 37), β₃ (SEQ ID NOS:38 through 41), and β₆ (SEQ ID NOS:26 through 29) for the region just downstream from the B3F primer (SEQ ID NO:34). Amino acid translations denoted by the one-letter code are shown below the second nucleotide of each codon. For the guinea pig sequences, only amino acids that differ from the human sequences are shown. The numbers shown along the right-hand margin denote the nucleotide number for human β₆. The sequences for human β₁ (SEQ ID NO:34) and β₃ (SEQ ID NO:38) are from previously published reports.

EXAMPLE II β₆ Associates with α_(V) And α_(F) Subunits

To determine that the novel β subunit of the present invention is associated with an α chain similar to other known integrins, antisera against peptides from the cytoplasmic domain sequence of β₆ were prepared. The following amino acid peptides from the cytoplasmic sequence of β₆ were prepared and used to immunize rabbits: RGSTSTFKNVTYKHR (residues 763-777) (SEQ ID NO:42) and YKHREKQKVDLSTDC (residues 774-788) (SEQ ID NO:43). The antisera were raised in rabbits according to standard procedures known in the art. Briefly, peptides were chemically coupled to keyhole lympet hemocyanin, and were injected in rabbits in either complete (first injection only) or incomplete Freund's adjuvant as described, for example, in Antibodies: A Laboratory Manual, E. Harlow and D. Lowe, eds., Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. 11724. Antisera were termed 6830 (to peptides corresponding to residues 763-777) (SEQ ID NO:42) and 6341 (to peptides corresponding to residues 774-788) (SEQ ID NO:43).

The resulting polyclonal artibodies were used to immunoprecipitate detergent lysates from the pancreatic carcinoma cell line FG-2 that had been surface radioiodinated according to procedures well known in the art such as described, for example, in Kajiji et al., EMBO J. 3:673-680 (1989). A complex of two bands was precipitated of respectively 150 kilodaltons (Kd) and 97 Kd in SDS-PAGE under non-reducing conditions. Under reducing conditions, the two bands migrated as a diffused band, extending from 130 Kd to 116 Kd. These bands were specific since pre-immune serum did not precipitate any of them and they were not present when the immiunoprecipitation was carried out in the presence of the corresponding immunogenic peptide. Furthermore, the same complex of two bands was precipitated by both the 6830 and 6841 antibodies, which were raised against independent peptides from the cytoplasmic sequence deduced from β₆ cDNA clones.

To determine which of the two precipitated bands corresponds to β₆, a SDS-heat denaturated lysate from surface-radioiodinated FG-2 cells was immunoprecipitated with the 6841 antibody. Only the 97 Kd band was detectable (non-reducing conditions), identifying it as the β₆ band. Under reducing conditions, the apparent molecular weight of this band increased to 116 Kd suggesting the presence of many intra-chain disulfide bonds, which is consistent with the primary structure of β₆ and of other integrin β chains.

The other band, of 150 Kd or 130 Kd under non-reducing or reducing conditions, respectively, is likely to be an α subunit since it dissociates after SDS-heat denaturation of the lysate, indicating that it is non-covalently associated with the β₆ polypeptide. Furthermore, similar to certain other integrin α chains, its molecular weight decreases under reducing conditions by about 20 Kd (130 Kd versus 150 Kd under non-reducing conditions) probably due to a disulfide linked small peptide that dissociates upon reduction.

To identify which α chain is associated with β₆, the αβ₆ integrin complex was purified by immuno-affinity chromatography on a 6841-protein. A sepharose matrix according to procedures well known in the art such as described, for example, in Kajiji et al., EMBO J. 3:673-680 (1989). The eluted material was imnunoprecipitated with antibodies specific for α₁, α₂, α₃, α₅, α₆ and α_(V), which are known to be expressed in FG-2 cell. Only the anti-α_(V) monoclonal antibody 142.19, obtained from Dr. David Cheresh, The Scripps Research Institution, La Jolla, Calif., reacted with the purified material, which indicates that the α_(V) is associated with β₆ in this pancreatic carcinoma cell line.

To confirm this data, immunodepletion experiments on surface-radioiodinated FG-2 lysates were performed according to methods well known in the art such as described in Kajiji et al., EMBO J. 3:673-680 (1989). The cell lysate was depleted with the 6841 anti-β₆ antibody or, in parallel, with a control antiserum, and then immunoprecipitated with the 142.19 anti-α_(V) antibody. A smaller amount of α_(V) was present in the immunoprecipitation on the β₆ depleted lysate and no 97 Kd β₆ band was visible. Instead, a smaller band of about 90 Kd was present. It is hypothesized that this smaller band represents the β₅ chain also associated with α_(V) in these celLs. In the control lysate depleted with normal rabbit serum, all three bands, 150 Kd (α_(V)), 97 Kd (β₆) and 90 Kd (β₅) were present after immunoprecipitation with the anti-α_(V) 1412.19 antibody.

Another immunodepletion was carried out using 142.19 antibody as the depleting antibody, or in parallel a mouse monoclonal as a control antibody. Immunoprecipitations of α_(V) -depleted lysate with anti-α_(V) 142.19 antibodies did not show the presence of any band, indicating that all α_(V) -containing integrins had been removed. However, the 6841 anti-β₆ antibody still precipitated a complex of two bands, one corresponding to β₆, the other with a molecular weight close to that of α_(V). This α chain, however, must differ from α_(V) since it is unreactive with anti-α_(V) monoclonal antibodies and is referred to herein as α_(F). In the control depleted lysates, the 6841 anti-β₆ antibody precipitates much stronger bands, consistent with the possibility that, in FG-2 cells, two β₆ integrins exist, α_(V) β₆ and α_(F) β₆.

Although the invention has been described with reference to the presently preferred embodiment, it should be understood that various modifications can be made without departing from the spirit of the invention. Accordingly, the invention is limited only by the claims.

    __________________________________________________________________________     #             SEQUENCE LISTING     - (1) GENERAL INFORMATION:     -    (iii) NUMBER OF SEQUENCES: 43     - (2) INFORMATION FOR SEQ ID NO:1:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 23 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:     #                23TGGA CCT     - (2) INFORMATION FOR SEQ ID NO:2:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 25 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY: misc.sub.-- - #difference     #"")      (B) LOCATION: replace(5,     #/standard.sub.-- name= "N=INOSINE"     -     (ix) FEATURE:               (A) NAME/KEY: misc.sub.-- - #difference     #"")      (B) LOCATION: replace(14,     #/standard.sub.-- name= "N=INOSINE"     -     (ix) FEATURE:               (A) NAME/KEY: misc.sub.-- - #difference     #"")      (B) LOCATION: replace(17,     #/standard.sub.-- name= "N=INOSINE"     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:     #               25 NCCA CCTTC     - (2) INFORMATION FOR SEQ ID NO:3:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 23 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY: misc.sub.-- - #difference     #"")      (B) LOCATION: replace(3,     #/standard.sub.-- name= "N=INOSINE"     -     (ix) FEATURE:               (A) NAME/KEY: misc.sub.-- - #difference     #"")      (B) LOCATION: replace(6,     #/standard.sub.-- name= "N=INOSINE"     -     (ix) FEATURE:               (A) NAME/KEY: misc.sub.-- - #difference     #"")      (B) LOCATION: replace(10,     #/standard.sub.-- name= "N=INOSINE"     -     (ix) FEATURE:               (A) NAME/KEY: misc.sub.-- - #difference     #"")      (B) LOCATION: replace(18,     #/standard.sub.-- name= "N=INOSINE"     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:     #                23GNCA GTG     - (2) INFORMATION FOR SEQ ID NO:4:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 20 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY: misc.sub.-- - #difference     #"")      (B) LOCATION: replace(6,     #/standard.sub.-- name= "N=INOSINE"     -     (ix) FEATURE:               (A) NAME/KEY: misc.sub.-- - #difference     #"")      (B) LOCATION: replace(14,     #/standard.sub.-- name= "N=INOSINE"     -     (ix) FEATURE:               (A) NAME/KEY: misc.sub.-- - #difference     #"")      (B) LOCATION: replace(18,     #/standard.sub.-- name= "N=INOSINE"     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:     # 20               ANTC     - (2) INFORMATION FOR SEQ ID NO:5:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 29 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY: misc.sub.-- - #difference     #"")      (B) LOCATION: replace(6,     #/standard.sub.-- name= "N=INOSINE"     -     (ix) FEATURE:               (A) NAME/KEY: misc.sub.-- - #difference     #"")      (B) LOCATION: replace(11,     #/standard.sub.-- name= "N=INOSINE"     -     (ix) FEATURE:               (A) NAME/KEY: misc.sub.-- - #difference     #"")      (B) LOCATION: replace(15,     #/standard.sub.-- name= "N=INOSINE"     -     (ix) FEATURE:               (A) NAME/KEY: misc.sub.-- - #difference     #"")      (B) LOCATION: replace(18,     #/standard.sub.-- name= "N=INOSINE"     -     (ix) FEATURE:               (A) NAME/KEY: misc.sub.-- - #difference     #"")      (B) LOCATION: replace(24,     #/standard.sub.-- name= "N=INOSINE"     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:     #            29    CNAC AAGNCCAAT     - (2) INFORMATION FOR SEQ ID NO:6:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 24 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:     #                24CTGA AAGA     - (2) INFORMATION FOR SEQ ID NO:7:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 24 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY: misc.sub.-- - #difference     #"")      (B) LOCATION: replace(3,     #/standard.sub.-- name= "N=INOSINE"     -     (ix) FEATURE:               (A) NAME/KEY: misc.sub.-- - #difference     #"")      (B) LOCATION: replace(6,     #/standard.sub.-- name= "N=INOSINE"     -     (ix) FEATURE:               (A) NAME/KEY: misc.sub.-- - #difference     #"")      (B) LOCATION: replace(9,     #/standard.sub.-- name= "N=INOSINE"     -     (ix) FEATURE:               (A) NAME/KEY: misc.sub.-- - #difference     #"")      (B) LOCATION: replace(18,     #/standard.sub.-- name= "N=INOSINE"     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:     #                24CNGA AAGA     - (2) INFORMATION FOR SEQ ID NO:8:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 17 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:     #   17             A     - (2) INFORMATION FOR SEQ ID NO:9:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 17 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:     #   17             A     - (2) INFORMATION FOR SEQ ID NO:10:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 23 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:     #                23TGGA CCT     - (2) INFORMATION FOR SEQ ID NO:11:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 23 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:     #                23TGGA CCT     - (2) INFORMATION FOR SEQ ID NO:12:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 23 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:     #                23TGGA CCT     - (2) INFORMATION FOR SEQ ID NO:13:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 23 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:     #                23TGGA CCT     - (2) INFORMATION FOR SEQ ID NO:14:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 26 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:     #              26  CCAT GATGCA     - (2) INFORMATION FOR SEQ ID NO:15:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 26 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:     #              26  CCAT CATGCA     - (2) INFORMATION FOR SEQ ID NO:16:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 26 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:     #              26  CCAT CATGCA     - (2) INFORMATION FOR SEQ ID NO:17:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 26 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:     #              26  CAAT AATGCA     - (2) INFORMATION FOR SEQ ID NO:18:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 24 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:     #                24GGCA GTGC     - (2) INFORMATION FOR SEQ ID NO:19:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 24 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:     #                24GTCA ATGT     - (2) INFORMATION FOR SEQ ID NO:20:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 24 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:     #                24GACA GTGT     - (2) INFORMATION FOR SEQ ID NO:21:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 24 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:     #                24GACA GTGC     - (2) INFORMATION FOR SEQ ID NO:22:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 30 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:     #           30     TGGT CATCTGGAAG     - (2) INFORMATION FOR SEQ ID NO:23:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 30 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23:     #           30     TGCT CATCTGGAAA     - (2) INFORMATION FOR SEQ ID NO:24:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 30 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24:     #           30     TGCT GATATGGAAG     - (2) INFORMATION FOR SEQ ID NO:25:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 30 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:25:     #           30     TATT GATTTGGAAA     - (2) INFORMATION FOR SEQ ID NO:26:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 2644 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: double               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 227..2593     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26:     - TAAACACAGC TTTTCTGCTT TACCTGTCCA GGTAGCCTCT GTTTTCATTT CA - #GTCTTAAT       60     - GAAAACTTTC TAACTTATAT CTCAAGTTTC TTTTCAAAGC AGTGTAAGTA GT - #ATTTAAAA      120     - TGTTATACTT CAAGAAAGAA AGACTTTAAC GATATTCAGC GTTGGTCTTG TA - #ACGCTGAA      180     #GGG ATT       235AATCG GTCTCGCACA GCAAGAACTG AAACGA ATG     #Ile            Met Gly     #                 1     - GAA CTG CTT TGC CTG TTC TTT CTA TTT CTA GG - #A AGG AAT GAT TCA CGT      283     Glu Leu Leu Cys Leu Phe Phe Leu Phe Leu Gl - #y Arg Asn Asp Ser Arg     #      15     - ACA AGG TGG CTG TGC CTG GGA GGT GCA GAA AC - #C TGT GAA GAC TGC CTG      331     Thr Arg Trp Leu Cys Leu Gly Gly Ala Glu Th - #r Cys Glu Asp Cys Leu     # 35     - CTT ATT GGA CCT CAG TGT GCC TGG TGT GCT CA - #G GAG AAT TTT ACT CAT      379     Leu Ile Gly Pro Gln Cys Ala Trp Cys Ala Gl - #n Glu Asn Phe Thr His     #                 50     - CCA TCT GGA GTT GGC GAA AGG TGT GAT ACC CC - #A GCA AAC CTT TTA GCT      427     Pro Ser Gly Val Gly Glu Arg Cys Asp Thr Pr - #o Ala Asn Leu Leu Ala     #             65     - AAA GGA TGT CAA TTA AAC TTC ATC GAA AAC CC - #T GTC TCC CAA GTA GAA      475     Lys Gly Cys Gln Leu Asn Phe Ile Glu Asn Pr - #o Val Ser Gln Val Glu     #         80     - ATA CTT AAA AAT AAG CCT CTC AGT GTA GGC AG - #A CAG AAA AAT AGT TCT      523     Ile Leu Lys Asn Lys Pro Leu Ser Val Gly Ar - #g Gln Lys Asn Ser Ser     #     95     - GAC ATT GTT CAG ATT GCA CCT CAA AGC TTG AT - #C CTT AAG TTG AGA CCA      571     Asp Ile Val Gln Ile Ala Pro Gln Ser Leu Il - #e Leu Lys Leu Arg Pro     100                 1 - #05                 1 - #10                 1 -     #15     - GGT GGT GCG CAG ACT CTG CAG GTG CAT GTC CG - #C CAG ACT GAG GAC TAC      619     Gly Gly Ala Gln Thr Leu Gln Val His Val Ar - #g Gln Thr Glu Asp Tyr     #               130     - CCG GTG GAT TTG TAT TAC CTC ATG GAC CTC TC - #C GCC TCC ATG GAT GAC      667     Pro Val Asp Leu Tyr Tyr Leu Met Asp Leu Se - #r Ala Ser Met Asp Asp     #           145     - GAC CTC AAC ACA ATA AAG GAG CTG GGC TCC GG - #C CTT TCC AAA GAG ATG      715     Asp Leu Asn Thr Ile Lys Glu Leu Gly Ser Gl - #y Leu Ser Lys Glu Met     #       160     - TCT AAA TTA ACC AGC AAC TTT AGA CTG GGC TT - #C GGA TCT TTT GTG GAA      763     Ser Lys Leu Thr Ser Asn Phe Arg Leu Gly Ph - #e Gly Ser Phe Val Glu     #   175     - AAA CCT GTA TCC CCT TTT GTG AAA ACA ACA CC - #A GAA GAA ATT GCC AAC      811     Lys Pro Val Ser Pro Phe Val Lys Thr Thr Pr - #o Glu Glu Ile Ala Asn     180                 1 - #85                 1 - #90                 1 -     #95     - CCT TGC AGT AGT ATT CCA TAC TTC TGT TTA CC - #T ACA TTT GGA TTC AAG      859     Pro Cys Ser Ser Ile Pro Tyr Phe Cys Leu Pr - #o Thr Phe Gly Phe Lys     #               210     - CAC ATT TTG CCA TTG ACA AAT GAT GCT GAA AG - #A TTC AAT GAA ATT GTG      907     His Ile Leu Pro Leu Thr Asn Asp Ala Glu Ar - #g Phe Asn Glu Ile Val     #           225     - AAG AAT CAG AAA ATT TCT GCT AAT ATT GAC AC - #A CCC GAA GGT GGA TTT      955     Lys Asn Gln Lys Ile Ser Ala Asn Ile Asp Th - #r Pro Glu Gly Gly Phe     #       240     - GAT GCA ATT ATG CAA GCT GCT GTG TGT AAG GA - #A AAA ATT GGC TGG CGG     1003     Asp Ala Ile Met Gln Ala Ala Val Cys Lys Gl - #u Lys Ile Gly Trp Arg     #   255     - AAT GAC TCC CTC CAC CTC CTG GTC TTT GTG AG - #T GAT GCT GAT TCT CAT     1051     Asn Asp Ser Leu His Leu Leu Val Phe Val Se - #r Asp Ala Asp Ser His     260                 2 - #65                 2 - #70                 2 -     #75     - TTT GGA ATG GAC AGC AAA CTA GCA GGC ATC GT - #C ATT CCT AAT GAC GGG     1099     Phe Gly Met Asp Ser Lys Leu Ala Gly Ile Va - #l Ile Pro Asn Asp Gly     #               290     - CTC TGT CAC TTG GAC AGC AAG AAT GAA TAC TC - #C ATG TCA ACT GTC TTG     1147     Leu Cys His Leu Asp Ser Lys Asn Glu Tyr Se - #r Met Ser Thr Val Leu     #           305     - GAA TAT CCA ACA ATT GGA CAA CTC ATT GAT AA - #A CTG GTA CAA AAC AAC     1195     Glu Tyr Pro Thr Ile Gly Gln Leu Ile Asp Ly - #s Leu Val Gln Asn Asn     #       320     - GTG TTA TTG ATC TTC GCT GTA ACC CAA GAA CA - #A GTT CAT TTA TAT GAG     1243     Val Leu Leu Ile Phe Ala Val Thr Gln Glu Gl - #n Val His Leu Tyr Glu     #   335     - AAT TAC GCA AAA CTT ATT CCT GGA GCT ACA GT - #A GGT CTA CTT CAG AAG     1291     Asn Tyr Ala Lys Leu Ile Pro Gly Ala Thr Va - #l Gly Leu Leu Gln Lys     340                 3 - #45                 3 - #50                 3 -     #55     - GAC TCC GGA AAC ATT CTC CAG CTG ATC ATC TC - #A GCT TAT GAA GAA CTG     1339     Asp Ser Gly Asn Ile Leu Gln Leu Ile Ile Se - #r Ala Tyr Glu Glu Leu     #               370     - CGG TCT GAG GTG GAA CTG GAA GTA TTA GGA GA - #C ACT GAA GGA CTC AAC     1387     Arg Ser Glu Val Glu Leu Glu Val Leu Gly As - #p Thr Glu Gly Leu Asn     #           385     - TTG TCA TTT ACA GCC ATC TGT AAC AAC GGT AC - #C CTC TTC CAA CAC CAA     1435     Leu Ser Phe Thr Ala Ile Cys Asn Asn Gly Th - #r Leu Phe Gln His Gln     #       400     - AAG AAA TGC TCT CAC ATG AAA GTG GGA GAC AC - #A GCT TCC TTC AGC GTG     1483     Lys Lys Cys Ser His Met Lys Val Gly Asp Th - #r Ala Ser Phe Ser Val     #   415     - ACT GTG AAT ATC CCA CAC TGC GAG AGA AGA AG - #C AGG CAC ATT ATC ATA     1531     Thr Val Asn Ile Pro His Cys Glu Arg Arg Se - #r Arg His Ile Ile Ile     420                 4 - #25                 4 - #30                 4 -     #35     - AAG CCT GTG GGG CTG GGG GAT GCC CTG GAA TT - #A CTT GTC AGC CCA GAA     1579     Lys Pro Val Gly Leu Gly Asp Ala Leu Glu Le - #u Leu Val Ser Pro Glu     #               450     - TGC AAC TGC GAC TGT CAG AAA GAA GTG GAA GT - #G AAC AGC TCC AAA TGT     1627     Cys Asn Cys Asp Cys Gln Lys Glu Val Glu Va - #l Asn Ser Ser Lys Cys     #           465     - CAC CAC GGG AAC GGC TCT TTC CAG TGT GGG GT - #G TGT GCC TGC CAC CCT     1675     His His Gly Asn Gly Ser Phe Gln Cys Gly Va - #l Cys Ala Cys His Pro     #       480     - GGC CAC ATG GGG CCT CGC TGT GAG TGT GGC GA - #G GAC ATG CTG AGC ACA     1723     Gly His Met Gly Pro Arg Cys Glu Cys Gly Gl - #u Asp Met Leu Ser Thr     #   495     - GAT TCC TGC AAG GAG GCC CCA GAT CAT CCC TC - #C TGC AGC GGA AGG GGT     1771     Asp Ser Cys Lys Glu Ala Pro Asp His Pro Se - #r Cys Ser Gly Arg Gly     500                 5 - #05                 5 - #10                 5 -     #15     - GAC TGC TAC TGT GGG CAG TGT ATC TGC CAC TT - #G TCT CCC TAT GGA AAC     1819     Asp Cys Tyr Cys Gly Gln Cys Ile Cys His Le - #u Ser Pro Tyr Gly Asn     #               530     - ATT TAT GGA CCT TAT TGC CAG TGT GAC AAT TT - #C TCC TGC GTG AGA CAC     1867     Ile Tyr Gly Pro Tyr Cys Gln Cys Asp Asn Ph - #e Ser Cys Val Arg His     #           545     - AAA GGG CTG CTC TGC GGA GGT AAC GGC GAC TG - #T GAC TGT GGT GAA TGT     1915     Lys Gly Leu Leu Cys Gly Gly Asn Gly Asp Cy - #s Asp Cys Gly Glu Cys     #       560     - GTG TGC AGG AGC GGC TGG ACT GGC GAG TAC TG - #C AAC TGC ACC ACC AGC     1963     Val Cys Arg Ser Gly Trp Thr Gly Glu Tyr Cy - #s Asn Cys Thr Thr Ser     #   575     - ACG GAC TCC TGC GTC TCT GAA GAT GGA GTG CT - #C TGC AGC GGG CGC GGG     2011     Thr Asp Ser Cys Val Ser Glu Asp Gly Val Le - #u Cys Ser Gly Arg Gly     580                 5 - #85                 5 - #90                 5 -     #95     - GAC TGT GTT TGT GGC AAG TGT GTT TGC ACA AA - #C CCT GGA GCC TCA GGA     2059     Asp Cys Val Cys Gly Lys Cys Val Cys Thr As - #n Pro Gly Ala Ser Gly     #               610     - CCA ACC TGT GAA CGA TGT CCT ACC TGT GGT GA - #C CCC TGT AAC TCT AAA     2107     Pro Thr Cys Glu Arg Cys Pro Thr Cys Gly As - #p Pro Cys Asn Ser Lys     #           625     - CGG AGC TGC ATT GAG TGC CAC CTG TCA GCA GC - #T GGC CAA GCC GGA GAA     2155     Arg Ser Cys Ile Glu Cys His Leu Ser Ala Al - #a Gly Gln Ala Gly Glu     #       640     - GAA TGT GTG GAC AAG TGC AAA CTA GCT GGT GC - #G ACC ATC AGT GAA GAA     2203     Glu Cys Val Asp Lys Cys Lys Leu Ala Gly Al - #a Thr Ile Ser Glu Glu     #   655     - GAA GAT TTC TCA AAG GAT GGT TCT GTT TCC TG - #C TCT CTG CAA GGA GAA     2251     Glu Asp Phe Ser Lys Asp Gly Ser Val Ser Cy - #s Ser Leu Gln Gly Glu     660                 6 - #65                 6 - #70                 6 -     #75     - AAT GAA TGT TTA ATT ACA TTC CTA ATA ACT AC - #A GAT AAT GAG GGG AAA     2299     Asn Glu Cys Leu Ile Thr Phe Leu Ile Thr Th - #r Asp Asn Glu Gly Lys     #               690     - ACC ATC ATT CAC AGC ATC AAT GAA AAA GAT TG - #T CCG AAG CCT CCA AAC     2347     Thr Ile Ile His Ser Ile Asn Glu Lys Asp Cy - #s Pro Lys Pro Pro Asn     #           705     - ATT CCC ATG ATC ATG TTA GGG GTT TCC CTG GC - #T ACT CTT CTC ATC GGG     2395     Ile Pro Met Ile Met Leu Gly Val Ser Leu Al - #a Thr Leu Leu Ile Gly     #       720     - GTT GTC CTA CTG TGC ATC TGG AAG CTA CTG GT - #G TCA TTT CAT GAT CGT     2443     Val Val Leu Leu Cys Ile Trp Lys Leu Leu Va - #l Ser Phe His Asp Arg     #   735     - AAA GAA GTT GCC AAA TTT GAA GCA GAA CGA TC - #A AAA GCC AAG TGG CAA     2491     Lys Glu Val Ala Lys Phe Glu Ala Glu Arg Se - #r Lys Ala Lys Trp Gln     740                 7 - #45                 7 - #50                 7 -     #55     - ACG GGA ACC AAT CCA CTC TAC AGA GGA TCC AC - #A AGT ACT TTT AAA AAT     2539     Thr Gly Thr Asn Pro Leu Tyr Arg Gly Ser Th - #r Ser Thr Phe Lys Asn     #               770     - GTA ACT TAT AAA CAC AGG GAA AAA CAA AAG GT - #A GAC CTT TCC ACA GAT     2587     Val Thr Tyr Lys His Arg Glu Lys Gln Lys Va - #l Asp Leu Ser Thr Asp     #           785     - TGC TAGAACTACT TTATGCATAA AAAAAGTCTG TTTCACTGAT ATGAAATGT - #T AATG     2644     Cys     - (2) INFORMATION FOR SEQ ID NO:27:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 788 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27:     - Met Gly Ile Glu Leu Leu Cys Leu Phe Phe Le - #u Phe Leu Gly Arg Asn     #                 15     - Asp Ser Arg Thr Arg Trp Leu Cys Leu Gly Gl - #y Ala Glu Thr Cys Glu     #             30     - Asp Cys Leu Leu Ile Gly Pro Gln Cys Ala Tr - #p Cys Ala Gln Glu Asn     #         45     - Phe Thr His Pro Ser Gly Val Gly Glu Arg Cy - #s Asp Thr Pro Ala Asn     #     60     - Leu Leu Ala Lys Gly Cys Gln Leu Asn Phe Il - #e Glu Asn Pro Val Ser     # 80     - Gln Val Glu Ile Leu Lys Asn Lys Pro Leu Se - #r Val Gly Arg Gln Lys     #                 95     - Asn Ser Ser Asp Ile Val Gln Ile Ala Pro Gl - #n Ser Leu Ile Leu Lys     #           110     - Leu Arg Pro Gly Gly Ala Gln Thr Leu Gln Va - #l His Val Arg Gln Thr     #       125     - Glu Asp Tyr Pro Val Asp Leu Tyr Tyr Leu Me - #t Asp Leu Ser Ala Ser     #   140     - Met Asp Asp Asp Leu Asn Thr Ile Lys Glu Le - #u Gly Ser Gly Leu Ser     145                 1 - #50                 1 - #55                 1 -     #60     - Lys Glu Met Ser Lys Leu Thr Ser Asn Phe Ar - #g Leu Gly Phe Gly Ser     #               175     - Phe Val Glu Lys Pro Val Ser Pro Phe Val Ly - #s Thr Thr Pro Glu Glu     #           190     - Ile Ala Asn Pro Cys Ser Ser Ile Pro Tyr Ph - #e Cys Leu Pro Thr Phe     #       205     - Gly Phe Lys His Ile Leu Pro Leu Thr Asn As - #p Ala Glu Arg Phe Asn     #   220     - Glu Ile Val Lys Asn Gln Lys Ile Ser Ala As - #n Ile Asp Thr Pro Glu     225                 2 - #30                 2 - #35                 2 -     #40     - Gly Gly Phe Asp Ala Ile Met Gln Ala Ala Va - #l Cys Lys Glu Lys Ile     #               255     - Gly Trp Arg Asn Asp Ser Leu His Leu Leu Va - #l Phe Val Ser Asp Ala     #           270     - Asp Ser His Phe Gly Met Asp Ser Lys Leu Al - #a Gly Ile Val Ile Pro     #       285     - Asn Asp Gly Leu Cys His Leu Asp Ser Lys As - #n Glu Tyr Ser Met Ser     #   300     - Thr Val Leu Glu Tyr Pro Thr Ile Gly Gln Le - #u Ile Asp Lys Leu Val     305                 3 - #10                 3 - #15                 3 -     #20     - Gln Asn Asn Val Leu Leu Ile Phe Ala Val Th - #r Gln Glu Gln Val His     #               335     - Leu Tyr Glu Asn Tyr Ala Lys Leu Ile Pro Gl - #y Ala Thr Val Gly Leu     #           350     - Leu Gln Lys Asp Ser Gly Asn Ile Leu Gln Le - #u Ile Ile Ser Ala Tyr     #       365     - Glu Glu Leu Arg Ser Glu Val Glu Leu Glu Va - #l Leu Gly Asp Thr Glu     #   380     - Gly Leu Asn Leu Ser Phe Thr Ala Ile Cys As - #n Asn Gly Thr Leu Phe     385                 3 - #90                 3 - #95                 4 -     #00     - Gln His Gln Lys Lys Cys Ser His Met Lys Va - #l Gly Asp Thr Ala Ser     #               415     - Phe Ser Val Thr Val Asn Ile Pro His Cys Gl - #u Arg Arg Ser Arg His     #           430     - Ile Ile Ile Lys Pro Val Gly Leu Gly Asp Al - #a Leu Glu Leu Leu Val     #       445     - Ser Pro Glu Cys Asn Cys Asp Cys Gln Lys Gl - #u Val Glu Val Asn Ser     #   460     - Ser Lys Cys His His Gly Asn Gly Ser Phe Gl - #n Cys Gly Val Cys Ala     465                 4 - #70                 4 - #75                 4 -     #80     - Cys His Pro Gly His Met Gly Pro Arg Cys Gl - #u Cys Gly Glu Asp Met     #               495     - Leu Ser Thr Asp Ser Cys Lys Glu Ala Pro As - #p His Pro Ser Cys Ser     #           510     - Gly Arg Gly Asp Cys Tyr Cys Gly Gln Cys Il - #e Cys His Leu Ser Pro     #       525     - Tyr Gly Asn Ile Tyr Gly Pro Tyr Cys Gln Cy - #s Asp Asn Phe Ser Cys     #   540     - Val Arg His Lys Gly Leu Leu Cys Gly Gly As - #n Gly Asp Cys Asp Cys     545                 5 - #50                 5 - #55                 5 -     #60     - Gly Glu Cys Val Cys Arg Ser Gly Trp Thr Gl - #y Glu Tyr Cys Asn Cys     #               575     - Thr Thr Ser Thr Asp Ser Cys Val Ser Glu As - #p Gly Val Leu Cys Ser     #           590     - Gly Arg Gly Asp Cys Val Cys Gly Lys Cys Va - #l Cys Thr Asn Pro Gly     #       605     - Ala Ser Gly Pro Thr Cys Glu Arg Cys Pro Th - #r Cys Gly Asp Pro Cys     #   620     - Asn Ser Lys Arg Ser Cys Ile Glu Cys His Le - #u Ser Ala Ala Gly Gln     625                 6 - #30                 6 - #35                 6 -     #40     - Ala Gly Glu Glu Cys Val Asp Lys Cys Lys Le - #u Ala Gly Ala Thr Ile     #               655     - Ser Glu Glu Glu Asp Phe Ser Lys Asp Gly Se - #r Val Ser Cys Ser Leu     #           670     - Gln Gly Glu Asn Glu Cys Leu Ile Thr Phe Le - #u Ile Thr Thr Asp Asn     #       685     - Glu Gly Lys Thr Ile Ile His Ser Ile Asn Gl - #u Lys Asp Cys Pro Lys     #   700     - Pro Pro Asn Ile Pro Met Ile Met Leu Gly Va - #l Ser Leu Ala Thr Leu     705                 7 - #10                 7 - #15                 7 -     #20     - Leu Ile Gly Val Val Leu Leu Cys Ile Trp Ly - #s Leu Leu Val Ser Phe     #               735     - His Asp Arg Lys Glu Val Ala Lys Phe Glu Al - #a Glu Arg Ser Lys Ala     #           750     - Lys Trp Gln Thr Gly Thr Asn Pro Leu Tyr Ar - #g Gly Ser Thr Ser Thr     #       765     - Phe Lys Asn Val Thr Tyr Lys His Arg Glu Ly - #s Gln Lys Val Asp Leu     #   780     - Ser Thr Asp Cys     785     - (2) INFORMATION FOR SEQ ID NO:28:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 1732 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: double               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 1..1731     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28:     - TCC GCC TCC ATG GAC GAT GAC CTC AAC ACA AT - #C AAA GAG CTG GGC TCC       48     Ser Ala Ser Met Asp Asp Asp Leu Asn Thr Il - #e Lys Glu Leu Gly Ser     #                 15     - CTG CTT TCA AAG GAG ATG TCT AAA TTA ACT AG - #C AAC TTT AGA CTG GGC       96     Leu Leu Ser Lys Glu Met Ser Lys Leu Thr Se - #r Asn Phe Arg Leu Gly     #             30     - TTC GGC TCT TTT GTA GAA AAA CCC GTC TCC CC - #T TTT ATG AAA ACA ACA      144     Phe Gly Ser Phe Val Glu Lys Pro Val Ser Pr - #o Phe Met Lys Thr Thr     #         45     - CCA GAG GAA ATT GCC AAC CCT TGC AGT AGT AT - #T CCA TAT ATC TGC TTA      192     Pro Glu Glu Ile Ala Asn Pro Cys Ser Ser Il - #e Pro Tyr Ile Cys Leu     #     60     - CCT ACA TTT GGA TTC AAG CAC ATT CTG CCA TT - #G ACA AAT GAT GCT GAA      240     Pro Thr Phe Gly Phe Lys His Ile Leu Pro Le - #u Thr Asn Asp Ala Glu     # 80     - AGA TTC AAT GAA ATT GTG AAG AAA CAG AAA AT - #T TCT GCT AAT ATT GAC      288     Arg Phe Asn Glu Ile Val Lys Lys Gln Lys Il - #e Ser Ala Asn Ile Asp     #                 95     - AAC CCT GAA GGT GGA TTC GAC GCC ATT ATG CA - #A GCT GCT GTG TGT AAG      336     Asn Pro Glu Gly Gly Phe Asp Ala Ile Met Gl - #n Ala Ala Val Cys Lys     #           110     - GAA AAA ATT GGC TGG CGG AAT GAT TCG CTC CA - #T CTC CTA GTC TTC GTG      384     Glu Lys Ile Gly Trp Arg Asn Asp Ser Leu Hi - #s Leu Leu Val Phe Val     #       125     - AGT GAT GCC GAT TCT CAT TTT GGA ATG GAC AG - #C AAA CTG GCA GGC ATT      432     Ser Asp Ala Asp Ser His Phe Gly Met Asp Se - #r Lys Leu Ala Gly Ile     #   140     - GTC ATT CCC AAC GAT GGG CTG TGT CAC TTG GA - #C AGC AAG AAT GAA TAC      480     Val Ile Pro Asn Asp Gly Leu Cys His Leu As - #p Ser Lys Asn Glu Tyr     145                 1 - #50                 1 - #55                 1 -     #60     - TCC ATG TCA ACT GTC ATG GAA TAT CCA ACA AT - #T GGA CAA CTC ATT GAT      528     Ser Met Ser Thr Val Met Glu Tyr Pro Thr Il - #e Gly Gln Leu Ile Asp     #               175     - AAA GTG GTA CAA AAC AAT GTG TTA CTG ATC TT - #T GCT GTA ACC CAA GAA      576     Lys Val Val Gln Asn Asn Val Leu Leu Ile Ph - #e Ala Val Thr Gln Glu     #           190     - CAA GTT CCA CTA TAT GAG AAT TAT GCA AAA CT - #T ATT CCT GGA GCC ACA      624     Gln Val Pro Leu Tyr Glu Asn Tyr Ala Lys Le - #u Ile Pro Gly Ala Thr     #       205     - GTG GGG CTA CTT CAC AAG GAC TCT GGA AAC AT - #T CTC CAA CTG ATC ATC      672     Val Gly Leu Leu His Lys Asp Ser Gly Asn Il - #e Leu Gln Leu Ile Ile     #   220     - TCA GCT TAT GAA GAA CTG CGG TCT GAG GTG GA - #G CTG GAA GTA TTA GGA      720     Ser Ala Tyr Glu Glu Leu Arg Ser Glu Val Gl - #u Leu Glu Val Leu Gly     225                 2 - #30                 2 - #35                 2 -     #40     - GAT ACA GAG GGC CTC AAT CTT TCG TTC TCA GC - #T GTC TGT AAC AAT GGC      768     Asp Thr Glu Gly Leu Asn Leu Ser Phe Ser Al - #a Val Cys Asn Asn Gly     #               255     - ACT CTC TTC CCA CAC CAA AAG AAA TGC TTG CA - #C ATG AAA GTG GGA GAA      816     Thr Leu Phe Pro His Gln Lys Lys Cys Leu Hi - #s Met Lys Val Gly Glu     #           270     - ACA GCT TCA TTC AAT GTG ACT GTG AGT ATA CC - #A AAC TGT GAG AGA AAA      864     Thr Ala Ser Phe Asn Val Thr Val Ser Ile Pr - #o Asn Cys Glu Arg Lys     #       285     - AGC AGG CAT GTT ATC ATA AAG CCT GTG GGG CT - #G GGG GAC ACC CTG GAA      912     Ser Arg His Val Ile Ile Lys Pro Val Gly Le - #u Gly Asp Thr Leu Glu     #   300     - ATC CTT GTC AGC CCA GAA TGC AGC TGC GAT TG - #T CAG AAA GAA GTG GAA      960     Ile Leu Val Ser Pro Glu Cys Ser Cys Asp Cy - #s Gln Lys Glu Val Glu     305                 3 - #10                 3 - #15                 3 -     #20     - GTG AAC AGC TCC AAA TGC CAC AAT GGG AAC GG - #C TCC TAC CAG TGT GGG     1008     Val Asn Ser Ser Lys Cys His Asn Gly Asn Gl - #y Ser Tyr Gln Cys Gly     #               335     - GTG TGT GCC TGT AAC CCA GGC CAC ATG GGC CC - #T CAC TGC GAG TGT GGT     1056     Val Cys Ala Cys Asn Pro Gly His Met Gly Pr - #o His Cys Glu Cys Gly     #           350     - GAG GAC ACG CTG AGC ACA GAT TCC TGC AAG GA - #G ACC CCA GAC CAT CCC     1104     Glu Asp Thr Leu Ser Thr Asp Ser Cys Lys Gl - #u Thr Pro Asp His Pro     #       365     - TCG TGC AGC GGA AGG GGT GAC TGC TAC TGT GG - #G CAG TGC ATC TGC CAC     1152     Ser Cys Ser Gly Arg Gly Asp Cys Tyr Cys Gl - #y Gln Cys Ile Cys His     #   380     - TTG TCT CCC TAT GGA AAC ATT TAT GGA CCT TA - #C TGC CAG TGT GAC AAT     1200     Leu Ser Pro Tyr Gly Asn Ile Tyr Gly Pro Ty - #r Cys Gln Cys Asp Asn     385                 3 - #90                 3 - #95                 4 -     #00     - TTC TCC TGT GTG AGG CAC AAA GGG CTG CTC TG - #T GGA GAT AAC GGA GAC     1248     Phe Ser Cys Val Arg His Lys Gly Leu Leu Cy - #s Gly Asp Asn Gly Asp     #               415     - TGT GAA TGT GGG GAA TGC GTG TGC AGG AGT GG - #T TGG ACC GGA GAG TAC     1296     Cys Glu Cys Gly Glu Cys Val Cys Arg Ser Gl - #y Trp Thr Gly Glu Tyr     #           430     - TGC AAC TGT ACC ACC AGC ACA GAC ACC TGC AT - #C TCC GAA GAC GGC ACG     1344     Cys Asn Cys Thr Thr Ser Thr Asp Thr Cys Il - #e Ser Glu Asp Gly Thr     #       445     - CTC TGC AGC GGG CGC GGG GAC TGC GTC TGT GG - #C AAG TGT GTC TGC ACG     1392     Leu Cys Ser Gly Arg Gly Asp Cys Val Cys Gl - #y Lys Cys Val Cys Thr     #   460     - AAC CCT GGA GCC TCG GGA CCC ACC TGT GAA CG - #A TGT CCT ACC TGT AGT     1440     Asn Pro Gly Ala Ser Gly Pro Thr Cys Glu Ar - #g Cys Pro Thr Cys Ser     465                 4 - #70                 4 - #75                 4 -     #80     - GAC CCC TGT AAC TCT AAA CGG AGC TGC ATT GA - #A TGC CAC CTG TCT GCA     1488     Asp Pro Cys Asn Ser Lys Arg Ser Cys Ile Gl - #u Cys His Leu Ser Ala     #               495     - GAT GGT CAG CCT GGA GAA GAA TGT GTG GAC AA - #A TGC AAA CTA GCA GGT     1536     Asp Gly Gln Pro Gly Glu Glu Cys Val Asp Ly - #s Cys Lys Leu Ala Gly     #           510     - GTG ACC ATC AGC AAA GAA GCA GAT TTC TCA AA - #G GAT AGT TCT GTT TCC     1584     Val Thr Ile Ser Lys Glu Ala Asp Phe Ser Ly - #s Asp Ser Ser Val Ser     #       525     - TGC TCC CTG CAA GGA GAA AAT GAA TGT CTT AT - #T ACA TTC CTA ATA AGT     1632     Cys Ser Leu Gln Gly Glu Asn Glu Cys Leu Il - #e Thr Phe Leu Ile Ser     #   540     - ACA GAT AAT GAG GGA AAA ACC ATC ATT CAC AA - #C ATC AGT GAA AAA GAC     1680     Thr Asp Asn Glu Gly Lys Thr Ile Ile His As - #n Ile Ser Glu Lys Asp     545                 5 - #50                 5 - #55                 5 -     #60     - TGC CCC AAA CCT CCA AAT ATT CCT ATG ATC AT - #G TTG GGG GTT TCA CTG     1728     Cys Pro Lys Pro Pro Asn Ile Pro Met Ile Me - #t Leu Gly Val Ser Leu     #               575     #           1732     Ala     - (2) INFORMATION FOR SEQ ID NO:29:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 577 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:29:     - Ser Ala Ser Met Asp Asp Asp Leu Asn Thr Il - #e Lys Glu Leu Gly Ser     #                 15     - Leu Leu Ser Lys Glu Met Ser Lys Leu Thr Se - #r Asn Phe Arg Leu Gly     #             30     - Phe Gly Ser Phe Val Glu Lys Pro Val Ser Pr - #o Phe Met Lys Thr Thr     #         45     - Pro Glu Glu Ile Ala Asn Pro Cys Ser Ser Il - #e Pro Tyr Ile Cys Leu     #     60     - Pro Thr Phe Gly Phe Lys His Ile Leu Pro Le - #u Thr Asn Asp Ala Glu     # 80     - Arg Phe Asn Glu Ile Val Lys Lys Gln Lys Il - #e Ser Ala Asn Ile Asp     #                 95     - Asn Pro Glu Gly Gly Phe Asp Ala Ile Met Gl - #n Ala Ala Val Cys Lys     #           110     - Glu Lys Ile Gly Trp Arg Asn Asp Ser Leu Hi - #s Leu Leu Val Phe Val     #       125     - Ser Asp Ala Asp Ser His Phe Gly Met Asp Se - #r Lys Leu Ala Gly Ile     #   140     - Val Ile Pro Asn Asp Gly Leu Cys His Leu As - #p Ser Lys Asn Glu Tyr     145                 1 - #50                 1 - #55                 1 -     #60     - Ser Met Ser Thr Val Met Glu Tyr Pro Thr Il - #e Gly Gln Leu Ile Asp     #               175     - Lys Val Val Gln Asn Asn Val Leu Leu Ile Ph - #e Ala Val Thr Gln Glu     #           190     - Gln Val Pro Leu Tyr Glu Asn Tyr Ala Lys Le - #u Ile Pro Gly Ala Thr     #       205     - Val Gly Leu Leu His Lys Asp Ser Gly Asn Il - #e Leu Gln Leu Ile Ile     #   220     - Ser Ala Tyr Glu Glu Leu Arg Ser Glu Val Gl - #u Leu Glu Val Leu Gly     225                 2 - #30                 2 - #35                 2 -     #40     - Asp Thr Glu Gly Leu Asn Leu Ser Phe Ser Al - #a Val Cys Asn Asn Gly     #               255     - Thr Leu Phe Pro His Gln Lys Lys Cys Leu Hi - #s Met Lys Val Gly Glu     #           270     - Thr Ala Ser Phe Asn Val Thr Val Ser Ile Pr - #o Asn Cys Glu Arg Lys     #       285     - Ser Arg His Val Ile Ile Lys Pro Val Gly Le - #u Gly Asp Thr Leu Glu     #   300     - Ile Leu Val Ser Pro Glu Cys Ser Cys Asp Cy - #s Gln Lys Glu Val Glu     305                 3 - #10                 3 - #15                 3 -     #20     - Val Asn Ser Ser Lys Cys His Asn Gly Asn Gl - #y Ser Tyr Gln Cys Gly     #               335     - Val Cys Ala Cys Asn Pro Gly His Met Gly Pr - #o His Cys Glu Cys Gly     #           350     - Glu Asp Thr Leu Ser Thr Asp Ser Cys Lys Gl - #u Thr Pro Asp His Pro     #       365     - Ser Cys Ser Gly Arg Gly Asp Cys Tyr Cys Gl - #y Gln Cys Ile Cys His     #   380     - Leu Ser Pro Tyr Gly Asn Ile Tyr Gly Pro Ty - #r Cys Gln Cys Asp Asn     385                 3 - #90                 3 - #95                 4 -     #00     - Phe Ser Cys Val Arg His Lys Gly Leu Leu Cy - #s Gly Asp Asn Gly Asp     #               415     - Cys Glu Cys Gly Glu Cys Val Cys Arg Ser Gl - #y Trp Thr Gly Glu Tyr     #           430     - Cys Asn Cys Thr Thr Ser Thr Asp Thr Cys Il - #e Ser Glu Asp Gly Thr     #       445     - Leu Cys Ser Gly Arg Gly Asp Cys Val Cys Gl - #y Lys Cys Val Cys Thr     #   460     - Asn Pro Gly Ala Ser Gly Pro Thr Cys Glu Ar - #g Cys Pro Thr Cys Ser     465                 4 - #70                 4 - #75                 4 -     #80     - Asp Pro Cys Asn Ser Lys Arg Ser Cys Ile Gl - #u Cys His Leu Ser Ala     #               495     - Asp Gly Gln Pro Gly Glu Glu Cys Val Asp Ly - #s Cys Lys Leu Ala Gly     #           510     - Val Thr Ile Ser Lys Glu Ala Asp Phe Ser Ly - #s Asp Ser Ser Val Ser     #       525     - Cys Ser Leu Gln Gly Glu Asn Glu Cys Leu Il - #e Thr Phe Leu Ile Ser     #   540     - Thr Asp Asn Glu Gly Lys Thr Ile Ile His As - #n Ile Ser Glu Lys Asp     545                 5 - #50                 5 - #55                 5 -     #60     - Cys Pro Lys Pro Pro Asn Ile Pro Met Ile Me - #t Leu Gly Val Ser Leu     #               575     - Ala     - (2) INFORMATION FOR SEQ ID NO:30:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 798 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:30:     - Met Asn Leu Gln Pro Ile Phe Trp Ile Gly Le - #u Ile Ser Ser Val Cys     #                15     - Cys Val Phe Ala Gln Thr Asp Glu Asn Arg Cy - #s Leu Lys Ala Asn Ala     #            30     - Lys Ser Cys Gly Glu Cys Ile Gln Ala Gly Pr - #o Asn Cys Gly Trp Cys     #        45     - Thr Asn Ser Thr Phe Phe Gln Glu Gly Met Pr - #o Thr Ser Ala Arg Cys     #    60     - Asp Asp Leu Glu Ala Leu Lys Lys Lys Gly Cy - #s Pro Pro Asp Asp Ile     #80     - Glu Asn Pro Arg Gly Ser Lys Asp Ile Lys Ly - #s Asn Lys Asn Val Thr     #                95     - Asn Arg Ser Lys Gly Thr Ala Glu Lys Leu Ly - #s Pro Glu Asp Ile His     #           110     - Gln Ile Gln Pro Gln Gln Leu Val Leu Arg Le - #u Arg Ser Gly Glu Pro     #       125     - Gln Thr Phe Thr Leu Lys Phe Lys Arg Ala Gl - #u Asp Tyr Pro Ile Asp     #   140     - Leu Tyr Tyr Leu Met Asp Leu Ser Tyr Ser Me - #t Lys Asp Asp Leu Glu     145                 1 - #50                 1 - #55                 1 -     #60     - Asn Val Lys Ser Leu Gly Thr Asp Leu Met As - #n Glu Met Arg Arg Ile     #               175     - Thr Ser Asp Phe Arg Ile Gly Phe Gly Ser Ph - #e Val Glu Lys Thr Val     #           190     - Met Pro Tyr Ile Ser Thr Thr Pro Ala Lys Le - #u Arg Asn Pro Cys Thr     #       205     - Ser Glu Gln Asn Cys Thr Thr Pro Phe Ser Ty - #r Lys Asn Val Leu Ser     #   220     - Leu Thr Asn Lys Gly Glu Val Phe Asn Glu Le - #u Val Gly Lys Gln Arg     225                 2 - #30                 2 - #35                 2 -     #40     - Ile Ser Gly Asn Leu Asp Ser Pro Glu Gly Gl - #y Phe Asp Ala Ile Met     #               255     - Gln Val Ala Val Cys Gly Ser Leu Ile Gly Tr - #p Arg Asn Val Thr Arg     #           270     - Leu Leu Val Phe Ser Thr Asp Ala Gly Phe Hi - #s Phe Ala Gly Asp Gly     #       285     - Lys Leu Gly Gly Ile Val Leu Pro Asn Asp Gl - #y Gln Cys His Leu Glu     #   300     - Asn Asn Met Tyr Thr Met Ser His Tyr Tyr As - #p Tyr Pro Ser Ile Ala     305                 3 - #10                 3 - #15                 3 -     #20     - His Leu Val Gln Lys Leu Ser Glu Asn Asn Il - #e Gln Thr Ile Phe Ala     #               335     - Val Thr Glu Glu Phe Gln Pro Val Tyr Lys Gl - #u Leu Lys Asn Leu Ile     #           350     - Pro Lys Ser Ala Val Gly Thr Leu Ser Ala As - #n Ser Ser Asn Val Ile     #       365     - Gln Leu Ile Ile Asp Ala Tyr Asn Ser Leu Se - #r Ser Glu Val Ile Leu     #   380     - Glu Asn Gly Lys Leu Ser Glu Gly Val Thr Il - #e Ser Tyr Lys Ser Tyr     385                 3 - #90                 3 - #95                 4 -     #00     - Cys Lys Asn Gly Val Asn Gly Thr Gly Glu As - #n Gly Arg Lys Cys Ser     #               415     - Asn Ile Ser Ile Gly Asp Glu Val Gln Phe Gl - #u Ile Ser Ile Thr Ser     #           430     - Asn Lys Cys Pro Lys Lys Asp Ser Asp Ser Ph - #e Lys Ile Arg Pro Leu     #       445     - Gly Phe Thr Glu Glu Val Glu Val Ile Leu Gl - #n Tyr Ile Cys Glu Cys     #   460     - Glu Cys Gln Ser Glu Gly Ile Pro Glu Ser Pr - #o Lys Cys His Glu Gly     465                 4 - #70                 4 - #75                 4 -     #80     - Asn Gly Thr Phe Glu Cys Gly Ala Cys Arg Cy - #s Asn Glu Gly Arg Val     #               495     - Gly Arg His Cys Glu Cys Ser Thr Asp Glu Va - #l Asn Ser Glu Asp Met     #           510     - Asp Ala Tyr Cys Arg Lys Glu Asn Ser Ser Gl - #u Ile Cys Ser Asn Asn     #       525     - Gly Glu Cys Val Cys Gly Gln Cys Val Cys Ar - #g Lys Arg Asp Asn Thr     #   540     - Asn Glu Ile Tyr Ser Gly Lys Phe Cys Glu Cy - #s Asp Asn Phe Asn Cys     545                 5 - #50                 5 - #55                 5 -     #60     - Asp Arg Ser Asn Gly Leu Ile Cys Gly Gly As - #n Gly Val Cys Lys Cys     #               575     - Arg Val Cys Glu Cys Asn Pro Asn Tyr Thr Gl - #y Ser Ala Cys Asp Cys     #           590     - Ser Leu Asp Thr Ser Thr Cys Glu Ala Ser As - #n Gly Gln Ile Cys Asn     #       605     - Gly Arg Gly Ile Cys Glu Cys Gly Val Cys Ly - #s Cys Thr Asp Pro Lys     #   620     - Phe Gln Gly Gln Thr Cys Glu Met Cys Gln Th - #r Cys Leu Gly Val Cys     625                 6 - #30                 6 - #35                 6 -     #40     - Ala Glu His Lys Glu Cys Val Gln Cys Arg Al - #a Phe Asn Lys Gly Glu     #               655     - Lys Lys Asp Thr Cys Thr Gln Glu Cys Ser Ty - #r Phe Asn Ile Thr Lys     #           670     - Val Glu Ser Arg Asp Lys Leu Pro Gln Pro Va - #l Gln Pro Asp Pro Val     #       685     - Ser His Cys Lys Glu Lys Asp Val Asp Asp Cy - #s Trp Phe Tyr Phe Thr     #   700     - Tyr Ser Val Asn Gly Asn Asn Glu Val Met Va - #l His Val Val Glu Asn     705                 7 - #10                 7 - #15                 7 -     #20     - Pro Glu Cys Pro Thr Gly Pro Asp Ile Ile Pr - #o Ile Val Ala Gly Val     #               735     - Val Ala Gly Ile Val Leu Ile Gly Leu Ala Le - #u Leu Leu Ile Trp Lys     #           750     - Leu Leu Met Ile Ile His Asp Arg Arg Glu Ph - #e Ala Lys Phe Glu Lys     #       765     - Glu Lys Met Asn Ala Lys Trp Asp Thr Gly Gl - #u Asn Pro Ile Tyr Lys     #   780     - Ser Ala Val Thr Thr Val Val Asn Pro Lys Ty - #r Glu Gly Lys     785                 7 - #90                 7 - #95     - (2) INFORMATION FOR SEQ ID NO:31:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 769 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:31:     - Met Leu Gly Leu Arg Pro Pro Leu Leu Ala Le - #u Val Gly Leu Leu Ser     #                15     - Leu Gly Cys Val Leu Ser Gln Glu Cys Thr Ly - #s Phe Lys Val Ser Ser     #            30     - Cys Arg Glu Cys Ile Glu Ser Gly Pro Gly Cy - #s Thr Trp Cys Gln Lys     #        45     - Leu Asn Phe Thr Gly Pro Gly Asp Pro Asp Se - #r Ile Arg Cys Asp Thr     #    60     - Arg Pro Gln Leu Leu Met Arg Gly Cys Ala Al - #a Asp Asp Ile Met Asp     #80     - Pro Thr Ser Leu Ala Glu Thr Gln Glu Asp Hi - #s Asn Gly Gly Gln Lys     #                95     - Gln Leu Ser Pro Gln Lys Val Thr Leu Tyr Le - #u Arg Pro Gly Gln Ala     #           110     - Ala Ala Phe Asn Val Thr Phe Arg Arg Ala Ly - #s Gly Tyr Pro Ile Asp     #       125     - Leu Tyr Tyr Leu Met Asp Leu Ser Tyr Ser Me - #t Leu Asp Asp Leu Arg     #   140     - Asn Val Lys Lys Leu Gly Gly Asp Leu Leu Ar - #g Ala Leu Asn Glu Ile     145                 1 - #50                 1 - #55                 1 -     #60     - Thr Glu Ser Gly Arg Ile Gly Phe Gly Ser Ph - #e Val Asp Lys Thr Val     #               175     - Leu Pro Phe Val Asn Thr His Pro Asp Lys Le - #u Arg Asn Pro Cys Pro     #           190     - Asn Lys Glu Lys Glu Cys Gln Pro Pro Phe Al - #a Phe Arg His Val Leu     #       205     - Lys Leu Thr Asn Asn Ser Asn Gln Phe Gln Th - #r Glu Val Gly Lys Gln     #   220     - Leu Ile Ser Gly Asn Leu Asp Ala Pro Glu Gl - #y Gly Leu Asp Ala Met     225                 2 - #30                 2 - #35                 2 -     #40     - Met Gln Val Ala Ala Cys Pro Glu Glu Ile Gl - #y Trp Arg Asn Val Thr     #               255     - Arg Leu Leu Val Phe Ala Thr Asp Asp Gly Ph - #e His Phe Ala Gly Asp     #           270     - Gly Lys Leu Gly Ala Ile Leu Thr Pro Asn As - #p Gly Arg Cys His Leu     #       285     - Glu Asp Asn Leu Tyr Lys Arg Ser Asn Glu Ph - #e Asp Tyr Pro Ser Val     #   300     - Gly Gln Leu Ala His Lys Leu Ala Glu Asn As - #n Ile Gln Pro Ile Phe     305                 3 - #10                 3 - #15                 3 -     #20     - Ala Val Thr Ser Arg Met Val Lys Thr Tyr Gl - #u Lys Leu Thr Glu Ile     #               335     - Ile Pro Lys Ser Ala Val Gly Glu Leu Ser Gl - #u Asp Ser Ser Asn Val     #           350     - Val His Leu Ile Lys Asn Ala Tyr Asn Lys Le - #u Ser Ser Arg Val Phe     #       365     - Leu Asp His Asn Ala Leu Pro Asp Thr Leu Ly - #s Val Thr Tyr Asp Ser     #   380     - Phe Cys Ser Asn Gly Val Thr His Arg Asn Gl - #n Pro Arg Gly Asp Cys     385                 3 - #90                 3 - #95                 4 -     #00     - Asp Gly Val Gln Ile Asn Val Pro Ile Thr Ph - #e Gln Val Lys Val Thr     #               415     - Ala Thr Glu Cys Ile Gln Glu Gln Ser Phe Va - #l Ile Arg Ala Leu Gly     #           430     - Phe Thr Asp Ile Val Thr Val Gln Val Leu Pr - #o Gln Cys Glu Cys Arg     #       445     - Cys Arg Asp Gln Ser Arg Asp Arg Ser Leu Cy - #s His Gly Lys Gly Phe     #   460     - Leu Glu Cys Gly Ile Cys Arg Cys Asp Thr Gl - #y Tyr Ile Gly Lys Asn     465                 4 - #70                 4 - #75                 4 -     #80     - Cys Glu Cys Gln Thr Gln Gly Arg Ser Ser Gl - #n Glu Leu Glu Gly Ser     #               495     - Cys Arg Lys Asp Asn Asn Ser Ile Ile Cys Se - #r Gly Leu Gly Asp Cys     #           510     - Val Cys Gly Gln Cys Leu Cys His Thr Ser As - #p Val Pro Gly Lys Leu     #       525     - Ile Tyr Gly Gln Tyr Cys Glu Cys Asp Thr Il - #e Asn Cys Glu Arg Tyr     #   540     - Asn Gly Gln Val Cys Gly Gly Pro Gly Arg Gl - #y Leu Cys Phe Cys Gly     545                 5 - #50                 5 - #55                 5 -     #60     - Lys Cys Arg Cys His Pro Gly Phe Glu Gly Se - #r Ala Cys Gln Cys Glu     #               575     - Arg Thr Thr Glu Gly Cys Leu Asn Pro Arg Ar - #g Val Glu Cys Ser Gly     #           590     - Arg Gly Arg Cys Arg Cys Asn Val Cys Glu Cy - #s His Ser Gly Tyr Gln     #       605     - Leu Pro Leu Cys Gln Glu Cys Pro Gly Cys Pr - #o Ser Pro Cys Gly Lys     #   620     - Tyr Ile Ser Cys Ala Glu Cys Leu Lys Phe Gl - #u Lys Gly Pro Phe Gly     625                 6 - #30                 6 - #35                 6 -     #40     - Lys Asn Cys Ser Ala Ala Cys Pro Gly Leu Gl - #n Leu Ser Asn Asn Pro     #               655     - Val Lys Gly Arg Thr Cys Lys Glu Arg Asp Se - #r Glu Gly Cys Trp Val     #           670     - Ala Tyr Thr Leu Glu Gln Gln Asp Gly Met As - #p Arg Tyr Leu Ile Tyr     #       685     - Val Asp Glu Ser Arg Glu Cys Val Ala Gly Pr - #o Asn Ile Ala Ala Ile     #   700     - Val Gly Gly Thr Val Ala Gly Ile Val Leu Il - #e Gly Ile Leu Leu Leu     705                 7 - #10                 7 - #15                 7 -     #20     - Val Ile Trp Lys Ala Leu Ile His Leu Ser As - #p Leu Arg Glu Tyr Arg     #               735     - Arg Phe Glu Lys Glu Lys Leu Lys Ser Gln Tr - #p Asn Asn Asp Asn Pro     #           750     - Leu Phe Lys Ser Ala Thr Thr Thr Val Met As - #n Pro Lys Phe Ala Glu     #       765     - Ser     - (2) INFORMATION FOR SEQ ID NO:32:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 788 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32:     - Met Arg Ala Arg Pro Arg Pro Arg Pro Leu Tr - #p Val Thr Val Leu Ala     #                15     - Leu Gly Ala Leu Ala Gly Val Gly Val Gly Gl - #y Pro Asn Ile Cys Thr     #            30     - Thr Arg Gly Val Ser Ser Cys Gln Gln Cys Le - #u Ala Val Ser Pro Met     #        45     - Cys Ala Trp Cys Ser Asp Glu Ala Leu Pro Le - #u Gly Ser Pro Arg Cys     #    60     - Asp Leu Lys Glu Asn Leu Leu Lys Asp Asn Cy - #s Ala Pro Glu Ser Ile     #80     - Glu Phe Pro Val Ser Glu Ala Arg Val Leu Gl - #u Asp Arg Pro Leu Ser     #                95     - Asp Lys Gly Ser Gly Asp Ser Ser Gln Val Th - #r Gln Val Ser Pro Gln     #           110     - Arg Ile Ala Leu Arg Leu Arg Pro Asp Asp Se - #r Lys Asn Phe Ser Ile     #       125     - Gln Val Arg Gln Val Glu Asp Tyr Pro Val As - #p Ile Tyr Tyr Leu Met     #   140     - Asp Leu Ser Tyr Ser Met Lys Asp Asp Leu Tr - #p Ser Ile Gln Asn Leu     145                 1 - #50                 1 - #55                 1 -     #60     - Gly Thr Lys Leu Ala Thr Gln Met Arg Lys Le - #u Thr Ser Asn Leu Arg     #               175     - Ile Gly Phe Gly Ala Phe Val Asp Lys Pro Va - #l Ser Pro Tyr Met Tyr     #           190     - Ile Ser Pro Pro Glu Ala Leu Glu Asn Pro Cy - #s Tyr Asp Met Lys Thr     #       205     - Thr Cys Leu Pro Met Phe Gly Tyr Lys His Va - #l Leu Thr Leu Thr Asp     #   220     - Gln Val Thr Arg Phe Asn Glu Glu Val Lys Ly - #s Gln Ser Val Ser Arg     225                 2 - #30                 2 - #35                 2 -     #40     - Asn Arg Asp Ala Pro Glu Gly Gly Phe Asp Al - #a Ile Met Gln Ala Thr     #               255     - Val Cys Asp Glu Lys Ile Gly Trp Arg Asn As - #p Ala Ser His Leu Leu     #           270     - Val Phe Thr Thr Asp Ala Lys Thr His Ile Al - #a Leu Asp Gly Arg Leu     #       285     - Ala Gly Ile Val Gln Pro Asn Asp Gly Gln Cy - #s His Val Gly Ser Asp     #   300     - Asn His Tyr Ser Ala Ser Thr Thr Met Asp Ty - #r Pro Ser Leu Gly Leu     305                 3 - #10                 3 - #15                 3 -     #20     - Met Thr Glu Lys Leu Ser Gln Lys Asn Ile As - #n Leu Ile Phe Ala Val     #               335     - Thr Glu Asn Val Val Asn Leu Tyr Gln Asn Ty - #r Ser Glu Leu Ile Pro     #           350     - Gly Thr Thr Val Gly Val Leu Ser Met Asp Se - #r Ser Asn Val Leu Gln     #       365     - Leu Ile Val Asp Ala Tyr Gly Lys Ile Arg Se - #r Lys Val Glu Leu Glu     #   380     - Val Arg Asp Leu Pro Glu Glu Leu Ser Leu Se - #r Phe Asn Ala Thr Cys     385                 3 - #90                 3 - #95                 4 -     #00     - Leu Asn Asn Glu Val Ile Pro Gly Leu Lys Se - #r Cys Met Gly Leu Lys     #               415     - Ile Gly Asp Thr Val Ser Phe Ser Ile Glu Al - #a Lys Val Arg Gly Cys     #           430     - Pro Gln Glu Lys Glu Lys Ser Phe Thr Ile Ly - #s Pro Val Gly Phe Lys     #       445     - Asp Ser Leu Ile Val Gln Val Thr Phe Asp Cy - #s Asp Cys Ala Cys Gln     #   460     - Ala Gln Ala Glu Pro Asn Ser His Arg Cys As - #n Asn Gly Asn Gly Thr     465                 4 - #70                 4 - #75                 4 -     #80     - Phe Glu Cys Gly Val Cys Arg Cys Gly Pro Gl - #y Trp Leu Gly Ser Gln     #               495     - Cys Glu Cys Ser Glu Glu Asp Tyr Arg Pro Se - #r Gln Gln Asp Glu Cys     #           510     - Ser Pro Arg Glu Gly Gln Pro Val Cys Ser Gl - #n Arg Gly Glu Cys Leu     #       525     - Cys Gly Gln Cys Val Cys His Ser Ser Asp Ph - #e Gly Lys Ile Thr Gly     #   540     - Lys Tyr Cys Glu Cys Asp Asp Phe Ser Cys Va - #l Arg Tyr Lys Gly Glu     545                 5 - #50                 5 - #55                 5 -     #60     - Met Cys Ser Gly His Gly Gln Cys Ser Cys Gl - #y Asp Cys Leu Cys Asp     #               575     - Ser Asp Trp Thr Gly Tyr Tyr Cys Asn Cys Th - #r Thr Arg Thr Asp Thr     #           590     - Cys Met Ser Ser Asn Gly Leu Leu Cys Ser Gl - #y Arg Gly Lys Cys Glu     #       605     - Cys Gly Ser Cys Val Cys Ile Gln Pro Gly Se - #r Tyr Gly Asp Thr Cys     #   620     - Glu Lys Cys Pro Thr Cys Pro Asp Ala Cys Th - #r Phe Lys Lys Glu Cys     625                 6 - #30                 6 - #35                 6 -     #40     - Val Glu Cys Lys Lys Phe Asp Arg Glu Pro Ty - #r Met Thr Glu Asn Thr     #               655     - Cys Asn Arg Tyr Cys Arg Asp Glu Ile Glu Se - #r Val Lys Glu Leu Lys     #           670     - Asp Thr Gly Lys Asp Ala Val Asn Cys Thr Ty - #r Lys Asn Glu Asp Asp     #       685     - Cys Val Val Arg Phe Gln Tyr Tyr Glu Asp Se - #r Ser Gly Lys Ser Ile     #   700     - Leu Tyr Val Val Glu Glu Pro Glu Cys Pro Ly - #s Gly Pro Asp Ile Leu     705                 7 - #10                 7 - #15                 7 -     #20     - Val Val Leu Leu Ser Val Met Gly Ala Ile Le - #u Leu Ile Gly Leu Ala     #               735     - Ala Leu Leu Ile Trp Lys Leu Leu Ile Thr Il - #e His Asp Arg Lys Glu     #           750     - Phe Ala Lys Phe Glu Glu Glu Arg Ala Arg Al - #a Lys Trp Asp Thr Ala     #       765     - Asn Asn Pro Leu Tyr Lys Glu Ala Thr Ser Th - #r Phe Thr Asn Ile Thr     #   780     - Tyr Arg Gly Thr     785     - (2) INFORMATION FOR SEQ ID NO:33:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 846 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:33:     - Met Ile Leu Glu Arg Asn Arg Arg Cys Gln Le - #u Ala Leu Leu Met Ile     #                15     - Ala Met Leu Ala Ala Ile Ala Ala Gln Thr As - #n Ala Gln Lys Ala Ala     #            30     - Lys Leu Thr Ala Val Ser Thr Cys Ala Ser Ly - #s Glu Lys Cys His Thr     #        45     - Cys Ile Gln Thr Glu Gly Cys Ala Trp Cys Me - #t Gln Pro Asp Phe Lys     #    60     - Gly Gln Ser Arg Cys Tyr Gln Asn Thr Ser Se - #r Leu Cys Pro Glu Glu     #80     - Phe Ala Tyr Ser Pro Ile Thr Val Glu Gln Il - #e Leu Val Asn Asn Lys     #                95     - Leu Thr Asn Gln Tyr Lys Ala Glu Leu Ala Al - #a Gly Gly Gly Gly Gly     #           110     - Ala Met Ser Gly Ser Ser Ser Ser Ser Tyr Se - #r Ser Ser Ser Ser Ser     #       125     - Ser Ser Phe Tyr Ser Gln Ser Ser Ser Gly Se - #r Ser Ser Ala Ser Gly     #   140     - Tyr Glu Glu Tyr Ser Ala Gly Glu Ile Val Gl - #n Ile Gln Pro Gln Ser     145                 1 - #50                 1 - #55                 1 -     #60     - Met Arg Leu Ala Leu Arg Val Asn Glu Lys Hi - #s Asn Ile Lys Ile Ser     #               175     - Tyr Ser Gln Ala Glu Gly Tyr Pro Val Asp Le - #u Tyr Tyr Leu Met Asp     #           190     - Leu Ser Lys Ser Met Glu Asp Asp Lys Ala Ly - #s Leu Ser Thr Leu Gly     #       205     - Asp Lys Leu Ser Glu Thr Met Lys Arg Ile Th - #r Asn Asn Phe His Leu     #   220     - Gly Phe Gly Ser Phe Val Asp Lys Val Leu Me - #t Pro Tyr Val Ser Thr     225                 2 - #30                 2 - #35                 2 -     #40     - Ile Pro Lys Lys Leu Glu His Pro Cys Glu As - #n Cys Lys Ala Pro Tyr     #               255     - Gly Tyr Gln Asn His Met Pro Leu Asn Asn As - #n Thr Glu Ser Phe Ser     #           270     - Asn Glu Val Lys Asn Ala Thr Val Ser Gly As - #n Leu Asp Ala Pro Glu     #       285     - Gly Gly Phe Asp Ala Ile Met Gln Ala Ile Al - #a Cys Arg Ser Gln Ile     #   300     - Gly Trp Arg Glu Gln Ala Arg Arg Leu Leu Va - #l Phe Ser Thr Asp Ala     305                 3 - #10                 3 - #15                 3 -     #20     - Gly Phe His Tyr Ala Gly Asp Gly Lys Leu Gl - #y Gly Val Ile Ala Pro     #               335     - Asn Asp Gly Glu Cys His Leu Ser Pro Lys Gl - #y Glu Tyr Thr His Ser     #           350     - Thr Leu Gln Asp Tyr Pro Ser Ile Ser Gln Il - #e Asn Gln Lys Val Lys     #       365     - Asp Asn Ala Ile Asn Ile Ile Phe Ala Val Th - #r Ala Ser Gln Leu Ser     #   380     - Val Tyr Glu Lys Leu Val Glu His Ile Gln Gl - #y Ser Ser Ala Ala Lys     385                 3 - #90                 3 - #95                 4 -     #00     - Leu Asp Asn Asp Ser Ser Asn Val Val Glu Le - #u Val Lys Glu Glu Tyr     #               415     - Arg Lys Ile Ser Ser Ser Val Glu Met Lys As - #p Asn Ala Thr Gly Asp     #           430     - Val Lys Ile Thr Tyr Phe Ser Ser Cys Leu Se - #r Asn Gly Pro Glu Val     #       445     - Gln Thr Ser Lys Cys Asp Asn Leu Lys Glu Gl - #y Gln Gln Val Ser Phe     #   460     - Thr Ala Gln Ile Gln Leu Leu Lys Cys Pro Gl - #u Asp Pro Arg Asp Trp     465                 4 - #70                 4 - #75                 4 -     #80     - Thr Gln Thr Ile His Ile Ser Pro Val Gly Il - #e Asn Glu Val Met Gln     #               495     - Ile Gln Leu Thr Met Leu Cys Ser Cys Pro Cy - #s Glu Asn Pro Gly Ser     #           510     - Ile Gly Tyr Gln Val Gln Ala Asn Ser Cys Se - #r Gly His Gly Thr Ser     #       525     - Met Cys Gly Ile Cys Asn Cys Asp Asp Ser Ty - #r Phe Gly Asn Lys Cys     #   540     - Glu Cys Ser Ala Thr Asp Leu Thr Ser Lys Ph - #e Ala Asn Asp Thr Ser     545                 5 - #50                 5 - #55                 5 -     #60     - Cys Arg Ala Asp Ser Thr Ser Thr Thr Asp Cy - #s Ser Gly Arg Gly His     #               575     - Cys Cys Val Gly Ala Cys Glu Cys His Lys Ar - #g Pro Asn Pro Ile Glu     #           590     - Ile Ile Ser Gly Lys His Cys Glu Cys Asp As - #n Phe Ser Cys Glu Arg     #       605     - Asn Arg Asn Gln Leu Cys Ser Gly Pro Asp Hi - #s Gly Thr Cys Glu Cys     #   620     - Gly Arg Cys Lys Cys Lys Pro Gly Trp Thr Gl - #y Ser Asn Cys Gly Cys     625                 6 - #30                 6 - #35                 6 -     #40     - Gln Glu Ser Asn Asp Thr Cys Met Pro Pro Gl - #y Gly Gly Glu Ile Cys     #               655     - Ser Gly His Gly Thr Cys Glu Cys Gly Val Cy - #s Lys Cys Thr Val Asn     #           670     - Asp Gln Gly Arg Phe Ser Gly Arg His Cys Gl - #u Lys Cys Pro Thr Cys     #       685     - Ser Gly Arg Cys Gln Glu Leu Lys Asp Cys Va - #l Gln Cys Gln Met Tyr     #   700     - Lys Thr Gly Glu Leu Lys Asn Gly Asp Asp Cy - #s Ala Arg Asn Cys Thr     705                 7 - #10                 7 - #15                 7 -     #20     - Gln Phe Val Pro Val Gly Val Glu Lys Val Gl - #u Ile Asp Glu Thr Lys     #               735     - Asp Glu Gln Met Cys Lys Phe Phe Asp Glu As - #p Asp Cys Lys Phe Met     #           750     - Phe Lys Tyr Ser Glu Gln Gly Glu Leu His Va - #l Tyr Ala Gln Glu Asn     #       765     - Lys Glu Cys Pro Ala Lys Val Phe Met Leu Gl - #y Ile Val Met Gly Val     #   780     - Ile Ala Ala Ile Val Leu Val Gly Leu Ala Il - #e Leu Leu Leu Trp Lys     785                 7 - #90                 7 - #95                 8 -     #00     - Leu Leu Thr Thr Ile His Asp Arg Arg Glu Ph - #e Ala Arg Phe Glu Lys     #               815     - Glu Arg Met Asn Ala Lys Trp Asp Thr Gly Gl - #u Asn Pro Ile Tyr Lys     #           830     - Gln Ala Thr Ser Thr Phe Lys Asn Pro Met Ty - #r Ala Gly Lys     #       845     - (2) INFORMATION FOR SEQ ID NO:34:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 282 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: double               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 1..282     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:34:     - TGT GTT TGT AGG AAG AGG GAT AAT ACA AAT GA - #A ATT TAT TCT GGC AAA       48     Cys Val Cys Arg Lys Arg Asp Asn Thr Asn Gl - #u Ile Tyr Ser Gly Lys     #                 15     - TTC TGC GAG TGT GAT AAT TTC AAC TGT GAT AG - #A TCC AAT GGC TTA ATT       96     Phe Cys Glu Cys Asp Asn Phe Asn Cys Asp Ar - #g Ser Asn Gly Leu Ile     #             30     - TGT GGA GGA AAT GGT GTT TGC AAG TGT CGT GT - #G TGT GAG TGC AAC CCC      144     Cys Gly Gly Asn Gly Val Cys Lys Cys Arg Va - #l Cys Glu Cys Asn Pro     #         45     - AAC TAC ACT GGC AGT GCA TGT GAC TGT TCT TT - #G GAT ACT AGT ACT TGT      192     Asn Tyr Thr Gly Ser Ala Cys Asp Cys Ser Le - #u Asp Thr Ser Thr Cys     #     60     - GAA GCC AGC AAC GGA CAG ATC TGC AAT GGC CG - #G GGC ATC TGC GAG TGT      240     Glu Ala Ser Asn Gly Gln Ile Cys Asn Gly Ar - #g Gly Ile Cys Glu Cys     # 80     - GGT GTC TGT AAG TGT ACA GAT CCG AAG TTT CA - #A GGG CAA ACG     # 282     Gly Val Cys Lys Cys Thr Asp Pro Lys Phe Gl - #n Gly Gln Thr     #                 90     - (2) INFORMATION FOR SEQ ID NO:35:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 94 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:35:     - Cys Val Cys Arg Lys Arg Asp Asn Thr Asn Gl - #u Ile Tyr Ser Gly Lys     #                 15     - Phe Cys Glu Cys Asp Asn Phe Asn Cys Asp Ar - #g Ser Asn Gly Leu Ile     #             30     - Cys Gly Gly Asn Gly Val Cys Lys Cys Arg Va - #l Cys Glu Cys Asn Pro     #         45     - Asn Tyr Thr Gly Ser Ala Cys Asp Cys Ser Le - #u Asp Thr Ser Thr Cys     #     60     - Glu Ala Ser Asn Gly Gln Ile Cys Asn Gly Ar - #g Gly Ile Cys Glu Cys     # 80     - Gly Val Cys Lys Cys Thr Asp Pro Lys Phe Gl - #n Gly Gln Thr     #                 90     - (2) INFORMATION FOR SEQ ID NO:36:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 282 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: double               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 1..282     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:36:     - TGC GTG TGC AGG AAG AGG GAC AAC ACC AAC GA - #G ATC TAC TCG GGC AAA       48     Cys Val Cys Arg Lys Arg Asp Asn Thr Asn Gl - #u Ile Tyr Ser Gly Lys     #                 15     - TTC TGC GAG TGC GAC AAC TTC AAC TGT GAT CG - #G TCC AAT GGC TTA ATC       96     Phe Cys Glu Cys Asp Asn Phe Asn Cys Asp Ar - #g Ser Asn Gly Leu Ile     #             30     - TGT GGA GGC AAT GGA GTG TGC CGG TGT CGT GT - #G TGC GAG TGC TTC CCC      144     Cys Gly Gly Asn Gly Val Cys Arg Cys Arg Va - #l Cys Glu Cys Phe Pro     #         45     - AAC TAC ACC GGC AGC GCC TGT GAC TGC TCT CT - #G GAC ACT GCG CCG TGC      192     Asn Tyr Thr Gly Ser Ala Cys Asp Cys Ser Le - #u Asp Thr Ala Pro Cys     #     60     - CTG GCC ACC AAC GGG CAG ATC TGC AAT GGC CG - #G GGT GTG TGC GAG TGC      240     Leu Ala Thr Asn Gly Gln Ile Cys Asn Gly Ar - #g Gly Val Cys Glu Cys     # 80     - GGC GTG TGC AAG TGC ACG GAC CCC AAG TTC CA - #G GGG CAG ACC     # 282     Gly Val Cys Lys Cys Thr Asp Pro Lys Phe Gl - #n Gly Gln Thr     #                 90     - (2) INFORMATION FOR SEQ ID NO:37:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 94 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:37:     - Cys Val Cys Arg Lys Arg Asp Asn Thr Asn Gl - #u Ile Tyr Ser Gly Lys     #                 15     - Phe Cys Glu Cys Asp Asn Phe Asn Cys Asp Ar - #g Ser Asn Gly Leu Ile     #             30     - Cys Gly Gly Asn Gly Val Cys Arg Cys Arg Va - #l Cys Glu Cys Phe Pro     #         45     - Asn Tyr Thr Gly Ser Ala Cys Asp Cys Ser Le - #u Asp Thr Ala Pro Cys     #     60     - Leu Ala Thr Asn Gly Gln Ile Cys Asn Gly Ar - #g Gly Val Cys Glu Cys     # 80     - Gly Val Cys Lys Cys Thr Asp Pro Lys Phe Gl - #n Gly Gln Thr     #                 90     - (2) INFORMATION FOR SEQ ID NO:38:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 276 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: double               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 1..276     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:38:     - TGT GTC TGC CAC AGC AGT GAC TTT GGC AAG AT - #C ACG GGC AAG TAC TGC       48     Cys Val Cys His Ser Ser Asp Phe Gly Lys Il - #e Thr Gly Lys Tyr Cys     #                 15     - GAG TGT GAC GAC TTC TCC TGT GTC CGC TAC AA - #G GGG GAG ATG TGC TCA       96     Glu Cys Asp Asp Phe Ser Cys Val Arg Tyr Ly - #s Gly Glu Met Cys Ser     #             30     - GGC CAT GGC CAG TGC AGC TGT GGG GAC TGC CT - #G TGT GAC TCC GAC TGG      144     Gly His Gly Gln Cys Ser Cys Gly Asp Cys Le - #u Cys Asp Ser Asp Trp     #         45     - ACC GGC TAC TAC TGC AAC TGT ACC ACG CGT AC - #T GAC ACC TGC ATG TCC      192     Thr Gly Tyr Tyr Cys Asn Cys Thr Thr Arg Th - #r Asp Thr Cys Met Ser     #     60     - AGC AAT GGG CTG CTG TGC AGC GGC CGC GGC AA - #G TGT GAA TGT GGC AGC      240     Ser Asn Gly Leu Leu Cys Ser Gly Arg Gly Ly - #s Cys Glu Cys Gly Ser     # 80     #      276TGT ATC CAG CCG GGC TCC TAT GGG GA - #C ACC     Cys Val Cys Ile Gln Pro Gly Ser Tyr Gly As - #p Thr     #                 90     - (2) INFORMATION FOR SEQ ID NO:39:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 92 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:39:     - Cys Val Cys His Ser Ser Asp Phe Gly Lys Il - #e Thr Gly Lys Tyr Cys     #                 15     - Glu Cys Asp Asp Phe Ser Cys Val Arg Tyr Ly - #s Gly Glu Met Cys Ser     #             30     - Gly His Gly Gln Cys Ser Cys Gly Asp Cys Le - #u Cys Asp Ser Asp Trp     #         45     - Thr Gly Tyr Tyr Cys Asn Cys Thr Thr Arg Th - #r Asp Thr Cys Met Ser     #     60     - Ser Asn Gly Leu Leu Cys Ser Gly Arg Gly Ly - #s Cys Glu Cys Gly Ser     # 80     - Cys Val Cys Ile Gln Pro Gly Ser Tyr Gly As - #p Thr     #                 90     - (2) INFORMATION FOR SEQ ID NO:40:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 276 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: double               (D) TOPOLOGY: linear     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 1..276     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:40:     - TGC TCC TGC CAC AGC GAT GAC TTT GGC AAG AT - #C ACG GGC AAG TAC TGT       48     Cys Ser Cys His Ser Asp Asp Phe Gly Lys Il - #e Thr Gly Lys Tyr Cys     #                 15     - GAG TGT GAT GAC TTC TCC TGT GTT CGC TAC AA - #A GGG GAG ATG TGC TCA       96     Glu Cys Asp Asp Phe Ser Cys Val Arg Tyr Ly - #s Gly Glu Met Cys Ser     #             30     - GGC CAT GGC CAG TGC AGC TGT GGG GAT TGC CT - #G TGT GAT TCT GAC TGG      144     Gly His Gly Gln Cys Ser Cys Gly Asp Cys Le - #u Cys Asp Ser Asp Trp     #         45     - ACT GGC TAC TAC TGT AAC TGT ACC ACA CTC AC - #T GAC ACC TGC ATG TCC      192     Thr Gly Tyr Tyr Cys Asn Cys Thr Thr Leu Th - #r Asp Thr Cys Met Ser     #     60     - AGC AAC GGG CTG TTG TGC AGC GGC CGG GGC AA - #G TGT GAA TGT GGC AGT      240     Ser Asn Gly Leu Leu Cys Ser Gly Arg Gly Ly - #s Cys Glu Cys Gly Ser     # 80     #      276TGC ATC CAG CCG GGA TCT TAT GGG GA - #C ACT     Cys Val Cys Ile Gln Pro Gly Ser Tyr Gly As - #p Thr     #                 90     - (2) INFORMATION FOR SEQ ID NO:41:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 92 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:41:     - Cys Ser Cys His Ser Asp Asp Phe Gly Lys Il - #e Thr Gly Lys Tyr Cys     #                 15     - Glu Cys Asp Asp Phe Ser Cys Val Arg Tyr Ly - #s Gly Glu Met Cys Ser     #             30     - Gly His Gly Gln Cys Ser Cys Gly Asp Cys Le - #u Cys Asp Ser Asp Trp     #         45     - Thr Gly Tyr Tyr Cys Asn Cys Thr Thr Leu Th - #r Asp Thr Cys Met Ser     #     60     - Ser Asn Gly Leu Leu Cys Ser Gly Arg Gly Ly - #s Cys Glu Cys Gly Ser     # 80     - Cys Val Cys Ile Gln Pro Gly Ser Tyr Gly As - #p Thr     #                 90     - (2) INFORMATION FOR SEQ ID NO:42:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 15 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:42:     - Arg Gly Ser Thr Ser Thr Phe Lys Asn Val Th - #r Tyr Lys His Arg     #                15     - (2) INFORMATION FOR SEQ ID NO:43:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 15 amino               (B) TYPE: amino acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -      (xi) SEQUENCE DESCRIPTION: SEQ ID NO: - #43:     - Tyr Lys His Arg Glu Lys Gln Lys Val Asp Le - #u Ser Thr Asp Cys     #                15     __________________________________________________________________________ 

We claim:
 1. A substantially purified integrin cell surface receptor subunit comprising β₆ having the amino acid sequence set forth in FIG. 3 for human.
 2. A substantially purified integrin comprising β₆ bound to an α subunit, wherein the β₆ subunit has the amino acid sequence set forth in FIG. 3 for human.
 3. The integrin of claim 2, wherein the subunit is a α_(v). 